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PHOLASIN 



PCT/GB99/03654 



The present invention relates to a protein, capable of bioluminescence, cDNA coding 
therefor and their uses, inter alia, in diagnostics and therapy. In particular, this invention 
5 relates to the cloning and sequencing of cDNA coding for pholasin from the bivalve 
mollusc Pholas dactylus. 

The term 'bioluminescence' refers to the emission of light resulting from a chemical 
reaction within, or produced by, a living organism. The essential components to the 
10 chemical reaction are: an organic molecule, usually comprising a luciferin; oxygen or one 
of its metabolites; and an enzyme or luciferase that catalyses the oxidation of the luciferin. 
The chemiluminescent reaction responsible for bioluminescence may be represented as 
follows: 

15 Luciferin + 0 2 , or 0 2 , or H 2 0 2 , or OH or OCT or OX" or l 0 2 (+ luciferase) > 

oxyluciferin + light. 

Up to three other substances may also be required to generate light or to generate light of 
the required colour and intensity. These are as follows: 

20 

(a) A cation, such as H\ Ca 2 \ Mg 2 * or a transition metal cation (eg Cu7Cu 2 ^, 
Fe 2 7Fe 3 \ La 3+ and V 3+ ); 

(b) A co-factor such as NAD(P)H, FMN or ATP; and/or 

(c) A fluor as an energy transfer acceptor. 

25 

Five chemical families of luciferin are known: 

(a) Aldehydes (found in the freshwater limpet Latia, earthworms, and with FMN in 
bacteria); 

30 (b) Imidazolopyrazines, which are the compounds most commonly responsible for 
bioluminescence in the sea (found in Sarcomastigophora, Cnidaria, Ctenophora, 
Annelida, Chaetognatha, some Arthropoda, some Mollusca and some Chordata); 
(c) Benzothiazoles (found in beetles such as fireflies and glow-worms); 
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(d) Linear tetrapyrroles (found in dinoflagellates, euphausiid shrimp and some fish); 
and 

(e) Flavins (found in bacteria, fungi, polychaete worms and some molluscs). 



5 Chemiluminescent reactions involving these luciferins may produce a glow or a flash with 
an emission of violet, blue, blue-green, green, yellow, orange or red light, or occasionally 
UV or IR light. The light emission may be linearly or circularly polarised. The luciferin 
or its product may also be detected and quantified by fluorescence or phosphorescence. 
As a chemical reaction is directly responsible for the light emission, it does not require 
10 exposure to UV, visible or IR light. However, some bioluminescent systems, such as that 
in the red organ of the deep sea fish Malacosteus, exhibit a photo-chemiluminescence, 
where light can trigger or enhance the chemiluminescent reaction. [Reference is directed 
to Chemiluminescence: Principles and Applications in Biology and Medicine, A K 
Campbell (1988), Horwood/VCH Chichester, Weinheim.] 

15 

In the case of some bioluminescent proteins, the luciferin is so tightly or covalently bound 
to the protein molecule that it does not diffuse away into the surrounding fluid as a result 
of the chemiluminescent reaction. In this case, the protein-Iuciferin complex is known as 
a photoprotein; and the protein itself is referred to as an apophotoprotein. Some 
20 bioluminescent proteins are proteins whose light emission or radiation depends on or may 
be altered by oxygen or one of its metabolites; these bioluminescent proteins are 
hereinafter referred to as 'bioluminescent oxidative indicator proteins' (BOIPs).BOIPS 
may thus be photoproteins or luciferin-luciferase systems. 

25 BOIPs, therefore, may be used to detect and quantify oxygen or one of its metabolites in 
individual cells, defined compartments of living cells such as the nucleus, whole organs 
and organisms - both animals and plants, including microbes such as viruses and bacteria 
and protozoa - as well as substances of biological interest such as substrates, metabolites, 
vitamins, drugs, intra- and extra-cellular signals, enzymes, antigens, antibodies and 

30 nucleic acids. Heretofore, it has only been known to employ native BOIPs 
extracellularly. 

The present invention therefore relates to a method for the detection and/or measurement 
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of oxygen or one of its metabolites in live cells (intracellular), which method comprises 
providing a BOIP, such as native or chemically- (or genetically-) modified BOIP or a 
'rainbow protein' based on such a BOIP, intracellularly and thereafter detecting and/or 
quantifying light emission therefrom and/or changes in colour, intensity and/or 
5 polarisation of emission(s) therefrom. 

Furthermore, it has now been found that, by sequencing the BOIP and identifying the 
cDNA encoding therefor, the recombinant BOIP can also be used in such a method, or 
chemically- or genetically-modified recombinant BOIP, or a 'rainbow protein' based on 

10 such a BOIP. For example, the bivalve mollusc Pholas dactylus has been shown to 
comprise a native photoprotein, which interacts with a luciferase, when they are secreted 
together by the mollusc to produce light when 0 2 or one of its metabolites is present. 
References to the Purification and Properties of Pholas Dactylus Luciferin and Luciferase 
can be found by Michelson in Methods in Enzymology LVII 385-406 (1978). References 

15 to detection of activation of neutrophils by detection of superoxide anion can be found by 
Roberts in Anal Biochem 160 139-148 (1987) and by Muller et al in J Biolum Chemilum 
3 105-113 (1989). The native photoprotein (known as pholasin) is made up of a 
glycosylated apoprotein (34kDa) with a small organic molecule, the luciferin, tightly 
bound to it. This luciferin (whose structure is unknown - Muller and Campbell in J 

20 Biolum Chemilum 5 25-30 (1990)) can be extracted from the protein moiety - the 
apopholasin - or from the organism by a standard treatment, such as mild acid. The 
pholasin may be collected from live molluscs found in sedimentary rocks at low water 
along the south coast of England from Plymouth to Folkestone and also along the French 
channel coast and in the Mediterranean. Further details may be obtained from marine 

25 fauna and the references cited herein. 



We have surprisingly found that pholasin can generate light even without the presence of 
the corresponding luciferase by addition of oxygen metabolites such as 0 2 H 2 0 2 , OC1- 
or other oxyhalide anions, or organic peroxides, and certain organic solvents such as 
30 dimethyl sulphide (DMSO) or dimethyl formamide (DMF). 

We have now identified the cDNA encoding for the (non-glycosylated) apoprotein of 
pholasin, which may also be called 'apopholasin 1 . Accordingly, the present invention 
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therefore further provides an isolated, purified or recombinant nucleic acid sequence 
comprising: 



(a) The apophotoprotein of phoiasin (alternatively, 'apopholasin'); 
5 (b) A sequence substantially homologous to or that hybridises to sequence (a) under 
stringent conditions; or 

(c) A sequence substantially homologous to or that hybridises under stringent 
conditions to the sequence (a) or (b) but for the degeneracy of the genetic code; or 

(d) An oligonucleotide specific for any of the sequences (a), (b) or (c). 

10 

The present invention will now be further described with reference to the accompanying 
Figures, in which: 

Figure 1 shows three different cDNAs encoding apopholasin, referred to as clones 40, 3 
15 and 5. Nucleotides in bold type show codons used for initiation and termination of 
translation; 

Figure 2 shows the three sequences of Figure 1 aligned to demonstrate the sequence 
similarity. This figure was generated by Clustal. Positions which are indicated with a 
20 star are identical in all three clones. The codons for the initiation and termination of 
translation are highlighted in bold; 

Figure 3 shows the oligonucleotides used for the complete sequencing of the positive 
clones. These were identified from the cDNA library; their positions in clone 40 are 
25 shown. Oligonucleotides are shown in bold type, portions of the flanking sequence of the 
Bluescript plasmid are shown in italic type; 

Figure 4 describes the protein sequence described by the DNA sequence coding for 
apopholasin and shows, in Figure 4A, the complete sequence of the positive clone 40 
30 identified from the Pholas dactylus light organ library. The first 20 amino acids at the N- 
terminus are a signal peptide, and this can either be retained or removed when generating 
the BIOP as described in this invention and, in Figure 4B, the cDNA coding for 
apopholasin with untranslated 5 ' and 3 ' ends. The untranslated regions are also shown; 
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Figure 5 describes the protein sequence for pholasin with (5B) and without (5 A) the 
signal peptide; 

5 Figure 6 shows the sequence for apopholasin genomic DNA, Two gDNA clones were 
indentifed but no introns were found; the Figure shows an alignment of the cDNA from 
cDNA clone 40 and the gDNA amplified by both rTth DNA polymerase XL and BioXAct 
polymerase. The sequences of the PCR product and the inserts in pGEM T were aligned 
with the sequence of the cDNA of clone 40 and were identical to this cDNA; 

10 Figure 7 describes the oligonucleotides used for screening and expression. Degenerate 
oligonucleotides for library screening are shown in Figure 7A; non-degenerate ones in 
Figure 7B; and oligonucleotides used for protein expression are shown in figure 7C; 
Figure 8 lists the main restriction sites in the DNA for engineering pholasin; 
Figure 9 is a schematic representation of Figure 8 mapped to the sequence of Figure 4A 

15 (translated region), and 

Figure 10 is a time course of hypo-chlorite triggered chemiluminescence. 

Accordingly, the present invention provides recombinant DNA encoding the 
apophotoprotein apopholasin and comprising the nucleotide sequence of the sequence 
20 disclosed in Figure 4B. Three different cDNAs coding for apopholasin have been 
isolated, having differing non-coding regions, respectively disclosed in Figure 1. The 
genomic DNA (gDNA), which contains no introns, has been shown (Figure 6) to 
comprise the same basic sequence as the cDNA. 

Pholasin is a glycoprotein having 11.1 glusamine, 9.8 fructose, 7.1 mannose and 5.2 
25 galactose residues. The cDNA for apopholasin has a molecular weight of 23,456 
compared to 34,600 of the pholasin extracted from Pholas, The difference in the 
molecular weights of native versus recombinant apopholasin is due to the glycosylation of 
the native protein and luciferin. The isoelectric point of the translated protein calculated 
by the ISOELECTRIC command of the GCG programme is at 3.84. The native protein 
30 has a lower isoelectric point (<3.5), probably due to the presence of bound sulphate. 

The three clones (Figure 2) isolated from the library encode a unique protein (Figures 4 
and 5), which does not have the same amino acid sequence as any known protein in the 
SwissProt database. The present invention therefore not only provides cDNA and RNA 
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coding for the protein, but also the recombinant protein per se, with or without 
glycosylation units. A comparison of segments of the pholasin protein sequence with the 
proteins in the SwissProt database identified several proteins with regions having a high 
sequence similarity to regions of the cloned protein. These included several proteins 
which interact with nucleotides (Table 1). 

Table 1 A comparison of sections of the sequence of the cloned protein with sections 

of proteins which interact with nucleotides. 



Protein 


Homologous region 
cloned protein 

homology (+ denotes a conserved amino acid) 
selected protein 


tRNA-splicing endonuclease p 
subunit 

Saccharomyces cerevisiae 
EC 3.1.27.9 


SLYDEDNNGVMDEGKVIPSETIE 
+L DEDNN + + G ++P E++E 
NLRDEDNNLLDENGDLLPLESLE 

LDQDVELDYTW 
LD DV DYTW 
LDHDVSKDYTW 


ATP-AMP transphosphorylase 
Cyprinus carpio 
EC 2.7,4.3 


VMDEGKVIPSETIEDDIKDCGLLDQDVELDY 
+M +G+++P +T+ D IKD + DV Y 
IMQKGELVPLDTVLDMIKDAMIAKADVSKGY 


DNA primase 
Synechocystis sp. 
EC 2.7.7. - 


E EVQCAMNWT QAN EYVFNVD 
++VQ M ++Q+ + +FN D 
DQVQSLMRFSQSKQI IFNFD 


purine permease 
Emericella nidulans 


VQCAMNWTQANEYV 
+ C+++WT+ N ++ 
IMCSVDWTRRNRFI 


DNA repair protein complementing 
XP-A cells homologue 
Drosophila melanogaster 


PDTVDEAEDTPSET 
PDT DE EDT + T 
PDTYDEEEDTYTHT 


ATP synthase (3 chain 
Peptococcus niger 
EC 3.6.1.34 


DTVDEAEDTPSET 
D +DEA + PSET 
DPI DEAGEVPSET 


DNA polymerase a 
Homo sapiens 
EC 2.7.7.7 


DEDNNGVMDEGKVIPSETIEDDIKD 
D+D G +++G+ I + +EDD D 
DDDGIGYVEDGREIFDDDLEDDALD 



Similarity was found between the Vargula luciferase and Renilla LBP, but no other 
bioluminescent protein. 
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Sequence homology between the cloned protein and (a) Vargula luciferase (b) Renilla 
LBP. An area of high homology in all three proteins is in bold print. 



(a) 148 206 
GTIWTVRVSLYDEDNNGVMDEGKVIPSETIEDDIKDCGLLD-QDVELDYTWTQNECDL 

V+VSL D+ + + T+DID + V++ + + 

YWNTWDVKVSLRDVESYTEVEKVTIRKQSTWDLIVDGKQVKVGGVDVSIPYSSENTSI 

10 353 412 
(b) 

105 166 

STMPGTYMLMDVCATRDADDKCIEGTIWTVRVSLYDEDNNG^DEGKVIPSETIEDDIKDC 

+ TR + VR+S+ + N+ K I 

15 AIKIAKLSAEKAEETRGFLRVADQLGLAPGVRISVEEAAVNATDSLLKMKAEEKAMAVIQSL 

41 1*>4 

Three potential glycosylation sites on the protein have the consensus triplet sequence Asn- 
Xaa-Ser/Thr (where Xaa can be any residue except proline). Thr 216 was identified as a 
20 potential site of O-linked glycosylation by a neural network which has been trained to 
identify this type of glycosylation. The amino acid sequence was also entered into a neural 
network which had been trained to identify eukaryotic signal peptides. This confirmed that 
the most likely cleavage site is between positions 20 and 21 (GSG-EE). 

25 Many families of proteins contain a "signature" sequence of amino acids. The sequence of 
the clones did not contain any of these signatures present in the PROSITE database. The 
amino acids from 170 to 185 correspond to the calcium binding consensus sequence 
[DENQST]X[DENQST]X[DENQST]X[DENQST]X[DENQST]XX[DENQST]. Thirteen 
potential phosphorylation sites were discovered that matched the consensus sequences for 

30 either the kinase phosphorylation site [RK](2)-x-[ST], the protein kinase C phosphorylation 
site [ST]-x[RK] or the casein kinase II phosphorylation site [ST]-x(2)-[DE]. 



Three N-linked glycosylation sites were identified in the translated sequence of the clones A 
neural network has been trained to identify this type of glycosylation which identified Thr 
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216 as a potential site of O-linked glycosylation. At least one of these sites must be 
glycosylated in the native protein in order to account for the presence of the sugar residues. 
A putative signal peptide region preceded the N terminus of the secreted protein 
(determined by amino acid sequencing and was identified as a signal peptide by a neural 
5 network). To confirm this result the protein sequence was searched with PSORT for motifs 
which would locate the cloned protein in a cellular compartment. The protein sequence did 
not contain any transmembrane regions or N-myristoylation patterns which would indicate 
the presence of a lipid anchor. No targeting or retention sequences were found for the 
nucleus, mitochondria, endoplasmic reticulum or peroxisome. 

10 

The fact that the clones had some sequence similarity with proteins that interact with 
nucleotides may suggest that pholasin binds a co-factor as part of the chemi luminescent 
reaction. Beetle luciferases require ATP binding for chemiiuminescent activity. There is no 
P-loop binding motif ((A,G)x4GK(S,T) or (A)x{4}GK(T)) in the amino acid sequence of 
15 these clones. However, not all ATP binding proteins contain this motif. Neither does the 
cloned protein contain the GXGXXG phosphate binding consensus sequence necessary for 
the binding of other co- factors such as nicotinamide adenine dinucleotide. 



The amino acid and sugar components of pholasin are not able to emit light at the 
20 wavelength of the native protein (490nm). This indicates that there must be a chromophore 
bound to the protein. There are, however, proteins in which the chromophore is composed 
of modified amino acid residues within the polypeptide. The best characterised of these is 
the green fluorescent protein (GFP). This has a chromophore which is a ring formed by the 
autocatalytic cyclisation of the residues Ser-dehydroTyr-Gly. The serine may be mutated to 
25 a threonine, which increases the amplitude of the emission at 488nm. Pholasin had no 
similar amino acid sequence. Putative luciferin binding regions have been identified for two 
bioluminescent chemistries. Aequorin has a putative coelenterazine binding region, which 
is also present in two sections of the Vargula hilgendorfli luciferase. The sequence of the 
cloned protein has no homology to the putative luciferin binding site of aequorin, but the 
30 region of the Vargula luciferase from residue 353 to 411 has some similarity, as does the 
LBP of Renilla reniformis, which also binds an imidazolopyrazine. This may indicate that 
the chemistry of pholasin bioluminescence involves an imidazolopyrazine luciferin. 
However, the region of homology is very small. The beetle luciferases contain an area of 
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low sequence homology which may bind the benzothiazole luciferin. This low homology 
may account for the different colours of beetle bioiuminescence. used a luciferin analogue 
(2-(4-benzoylphenyl) thiazole-4-carboxylic acid which photoinactivated the luciferase active 
site of the firefly Photinus pyralis. This photoinactivation was directly linked to the 

5 degradation of a small peptide sequence HHGF (residues 244-257). This is therefore 
postulated as a luciferin-binding site. The cloned protein does not have any sequence 
homology with these putative binding regions. Two strongly conserved regions of amino 
acids have also been found in both the luciferase and the luciferin binding protein of the 
dinoflagellate Gonyaulax polyedra. These regions were compared to the cloned protein, but 

10 no sequence similarity was found- No sequence identity could be established between the 
bacterial luciferases and the cloned protein. 

Therefore, the present invention provides cloned apophotoprotein apopholasin (and the 
cDNA coding therefor), which has identical properties to native (but non-glycosylated) 
15 apopholasin with respect to molecular weight, amino acid composition, potential for 
glycosylation, its highly acidic pi and its cellular location. Hence, the present invention 
can further provide the corresponding BOIP or modified BOIP, according to standard 
methods. 

20 The corresponding BOIP is preparable by bringing the apophotoprotein pholasin into 
association with the luciferin, also using standard methods. Although the luciferin is 
tightly bound in the native pholasin BOIP, it has been found that it may not be the case in 
the recombinant pholasin BOIP; indeed the luciferin may be weakly bound or merely 
present with the apoprotein. For example, a methanol, aqueous, acidic or other extract of 

25 Pholas dactylus (whole organism or light organ dissected from the animal) containing the 
* luciferin', or the pure luciferin, may be added to the solution, cell or organism (Figure 10 
shows the time course of hypo-chlorite triggered luminescence in these circumstances). A 
time course of apopholasin reactivation was performed by incubating partially purified 
recombinant pholasin secreted by insect cells with acid:menthanol extract of Pholas 

30 dactylus (•) for 0 (solid line), 1 (---), 2 ( ), 6( ) or 24 hours ( ), or incubated 

without extract (o). Controls of buffers only with no protein or acid:methanol extract ( ) 

and extract alone (■) were treated in identical conditions. Chemi luminescence was 
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triggered by the addtion of 2% soidum hypochlorite at 10 seconds (arrow) and is shown 
as chemiluminescent counts minus background light. A typical curve obtained by 

hypochlorite triggering of native pholasin is also shown (X). A representative experiment 
carried out in duplicate is shown. 

5 

The luciferin associates with the apo-BOIP forming the photoprotein or remains loosely 
bound so that it turns over like a luciferase.. The luciferin on the photoprotein then reacts 
with oxygen or one of its metabolites to produce light, in the presence or absence of the 
luciferase. The light emission may be detected, quantified, or imaged using a 
10 luminometer, photographic film or imaging camera, or by the naked eye. Alternatively, 
light emission may be generated spontaneously by intra- or extra-cellular metabolites 
reacting with the apo-BOIP. 

Although illustrated with respect to pholasin, the following may apply to any BOIP: the 
15 BOIP can be produced directly from native DNA, or from DNA engineered or amplified 
by the polymerase chain reaction. By this means, sites can be inserted within the protein 
by splitting the DNA into two or more pieces, or by adding DNA sequences to the 5' or 3' 
ends. For example, the DNA may be expressed in bacteria, yeast, an insect or human cell, 
or other suitable organism to produce protein which can be extracted and used. 

20 

In this instance, the protein produced from the cloned DNA reacts with oxygen or a 
metabolite of oxygen, such as the superoxide anion (O/), hydrogen peroxide (H 2 0,), 
hydroxy radical (OH'), an oxyhalide anion (OCT, OBr , OI\ OSCN ), nitric oxide (NO), an 
organic hydroperoxide or a radical ROO . The change in light emission enables the 

25 oxygen or metabolite(s) to be detected and quantified in live cells, organelles, or on the 
outer or inner surface of the plasma membrane, or within an organ of a live organism 
without the need to break them open or the need to separate bound and free fractions. 
This also enables an enzyme producing oxygen or one of its metabolites, such as 
chlorophyll, or enzymes such as oxidases and oxygenases which react directly with 

30 oxygen or one of its metabolites to attach oxygen to the substrate to be detected and 
quantified in live cells, organs and whole organisms, or extracts from any one of these. 

Also the BOIP can be made in vitro by transcription/translation in a cell lysate such as 
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rabbit reticulocyte lysate or wheat germ extract containing RNA polymerase. The DNA 

for the BOIP is first engineered to contain an RNA polymerase promoter, such as T7, 

SP6; bacterial promoter(s), such as lac, ara or trp; or mammalian promoter(s), such as 

actin, myosin, myelin proteins, TK, MRT-V, SV40, CMV, RSV, metallothionine, 

5 antibody, G6P dehydrogenase, and can be amplified in vitro using the polymerase chain 
reaction. A poly-A tail may be added at the 3' end and a tissue specific promoter or 
enhancer sequence added to the 5' or 3' end of the DNA coding for the BOIP or modified 
BOIP, enabling it to be expressed specifically in a target cell, such as a myocardial cell or 
a cancer cell. The expression of the BOIP in the target cell is detected and quantified by 

1 0 light intensity, colour or polarisation, as previously mentioned. 



The BOIP, or its DNA or RNA, may be incorporated into a live bacteria or eukaryotic cell 
using phage, virus, plasmid, calcium phosphate transfection, electroporation, liposome 
fusion, membrane pore forming proteins, micro-injection or DNA gun. Once inside cells 

15 or an appropriate extracellular environment, cell activation or injury will initiate or 
change the light emission from the BOIP. For example, expression in live organisms by 
micro-injection of protein, RNA or DNA, or by transgenic manipulation to produce a cell, 
such as a bacterial, microbial, animal or plant cell, eg a white blood cell, a heart cell, or a 
yeast, protozoan, fruit fly (Drosophila), nematode worm, polychaete worm, fish, human, 

20 mouse, rat, sheep, pig, horse or plant, which can generate its own light. 



The BOIP can also be incorporated into a defined part of a live cell by chemical means or 
by genetically engineering the BOIP to contain a signal peptide, which locates the BOIP 

25 to the inner or outer surface of the plasma membrane or within a particular organelle, such 
as peroxisome, mitochondrion, chloroplast, tonoplast, endoplasmic, reticulum, Golgi 
apparatus, endosome, lysosome, secretory vesicle, nucleus, nucleolus, nuclear membrane, 
plasma membrane, proteosome, or gap junction, or structure such as membrane receptor 
ion channel microtubule, cytoskeleton, nuclear skeleton, nuclear receptor, mitotic spindle 

30 or microfilaments. The signal peptide, added either chemically or genetically, will 
normally target the normal or modified BOIP to a particular intra- or extra-cellular site. 
For example, the sequence MLSRLSLRLLSRYLL or part of cytochrome oxidase on the 
N-terminus will target the BOIP to the mitochondrion; KKSALLALMYVCPGKADKE 
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or MLLPVPLLLGLLGLAA at the N-terminus will target the BOIP to the endoplasmic 

reticulum, a KDEL or HDEL or KEEL sequence at the C-terminus retaining it there. SKL 

at C-terminus targets BOIP to the peroxisome; PKKKRKV or an extension of this SV40 

large T-antigen signal will target it to the nucleus; and a palmitoylation and/or a 

5 myristoylation signal will target it to the plasma membrane. By coupling the BOIP to 
another protein that targets itself to a particular site, the BOIP can also be targeted there. 
For example, coupling the nuclear proteins nucleoplasmin or lamin B receptor to BOIP 
targets it to the nucleus; cytochrome oxidase at the N-terminus targets BOIP to the 
mitochondria; chlorophyll at the N-terminus targets BOIP to the chloroplast; a connexin 

10 at the N-terminus targets BOIP to the gap junction or plasma membrane; and SNAP 25 to 
the plasma membrane. 

Other modifications to the apoprotein, BOIP, or nucleotides coding therefor include, but 
are not limited to: 

15 

The apoprotein, such as apopholasin, may also be glycosylated, and used to detect and 
quantify secretion or movement of proteins through the secretory pathway. 

Nucleic acid coding for the BOIP when expressed inside a live cell may not only be 
20 modified but also regulated in this cell by gene expression, such as by promoters, 
enhancers or oncogenes. For example, the apoprotein, such as apopholasin, may be 
coupled to a gene regulator protein, such as a transcription factor, by genetic or chemical 
manipulation, such that the movement through a cell or of the regulator protein or its 
activity, can be detected or quantified. 

25 

The BOIP or apoprotein, or its DNA may be linked to another protein or DNA used in 
therapy, such that the other protein or DNA can be detected in live cells or in a whole 
organism, eg a human. 

30 The apoprotein, such as apopholasin, can also engineered genetically or chemically to 
contain a site or sites which can be covalently modified by enzymes such as 
phosphorylation (including ser/thr, his and tyr kinases and phosphatases), 
tranglutamination, proteolysis, ADP ribosylation, gly-or glu-cosylation, halogenation, 
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oxidation, methylation, palmitoylation, myristylation and farnesylation. 



The apoprotein, such as apopholasin, can be engineered genetically or chemically to 
contain an antigen or intracellular signal binding site, such as Ca 2+ , cyclic AMP, cyclic 
5 GMP, cyclic CMP, IP 3 , IP 4 , diacyl glycerol, ATP, ADP, AMP, GTP, or any oxy- or 
deoxy-rH^pnucleoside or nucleotide, a substrate, a drug, a nucleic and/or a gene regulator 
protein. 

The BOIP may also be converted to a rainbow protein by engineering a particular site 
10 such as described hereinabove into the BOIP, at the N- or C-terminus, or between a 
chimera of the BOIP and an energy transfer acceptor, such as GFP (wild type or any of 
the mutant GFPs). This is known as chemiluminescence, bioluminescence or 
fluorescence resonance transfer (CRET, BRET or FRET, respectively). Conversion of the 
BOIP to a 'rainbow protein' may be effected by reaction with a cellular substance, 
15 modification genetically or chemically, or by linking the BOIP to a fluor, such as the 
green fluorescent protein or the red fluorescent protein in the deep sea fish Malacosteus. 
The result is a BOIP which changes its colour and/or intensity and/or polarisation of 
emission. The change in colour occurs by energy transfer, eg resonance transfer (CRET 
or FRET) or electron transfer. 

20 

The initial (unmodified) BOIP may be the apophotoprotein, its DNA or RNA, from the 
bivalve mollusc Pholas dactylus, or it may be another BOIP, such as one from the 
mollusc Rocellaria grandis or the squid Ommastraphes, or earthworm luciferase, which 
produce light with oxygen metabolites in a way very similar to Pholas dactylus. 

25 

The BOIP, apo-BOIP, or nucleic acid coding for it, whether modified or not, may 
therefore be used in a range of biology and investigations such as: 

(a) Detection, location and measurement of signals in substrates, such as live cells, 
30 organs or organisms, or in extracellular fluids; 

(b) Detection, location and measurement of oxygen and its metabolites in substrates, 
such as live cells, organs or organisms, or in extracellular fluids, water (sea and 
fresh), soil or the atmosphere; 
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(c) Detection and location of normal cells such as microbes (protozoa, yeast, fungi, 
moulds, bacteria, viruses); 

(d) Detection and location of abnormal cells, such as cancer cells, hyperactive cells in 
rheumatoid arthritis and other inflammatory diseases, cells infected with a 

5 pathogen, such as a virus or other infectious agents, cells damaged by physical, 

chemical or biological attack, cells damaged by perfusion or reperfusion injury or 
cells damaged by oxygen or one of its metabolites; 

(e) Measurement and location of enzymes, particularly those producing oxygen or its 
metabolites, and other tumour reactions in cells or biological fluids; 

1 0 (f) DNA and RNA binding assays; 

(g) Immunoassay and other protein binding assays; 

(h) In genetic engineering, in the development of transgenic animals and plants, and 
microbes; in horticulture; agriculture; medicine and veterinary medicine; and/or 

(i) in genetic entertainment by incorporation into light sticks, greeting cards or toys to 
1 5 produce light of various colour, intensities, oscillations, flashes and glows; or in 

comestibles, such as food, drinks, including beers, wines, spirits, colas and other 
soft drinks. 



20 Accordingly, the present invention further provides an apoprotein, such as pholasin 
apoprotein (or apopholasin) in both unglycosylated and glycosylated forms, and a BOIP 
thereof, such as pholasin, either alone (but excluding native proteins per se that have 
already been isolated, such as native pholasin per se) or in association with one or more 
of: a targeting or signal peptide; a glycosylate; a site capable of modification by an 

25 enzyme; an antigen or intracellular signal binding site; a promoter, an enhancer or an 
oncogene or a pharmacologically active substance; or the like. The present invention 
further provides a recombinant construct comprising a nucleic acid sequence encoding for 
any of these proteins; a vector containing a nucleic acid sequence encoding for any of 
these proteins; a host transformed by such vector; a live cell, such as bacterial, insect, 

30 eukaryotic, prokaryotic, archae or plant cells containing or expressing any of these 
proteins; and a rainbow protein, as described herein, together with a nucleic acid 
sequence encoding therefor. 
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The present invention will now be illustrated with reference to the following non-limiting 
examples, in which the methodology referred to is known to those skilled in the art and/or 
may be carried out by analogy with reference to the protocols disclosed in the following 
references, the contents of which are herein incorporated by reference in their entirety: 
5 BOOKS 

1. Campbell,AK. (1988). Chemiluminescence: principles and applications in biology 
and medicine, pp608. Horwood/VCH, Chichester and Weinheim. 

2. Campbell,AK. (1994). Rubicon: the fifth dimension of biology, pp 304. Duckworth, 
London. 

10 PAPERS 

1. Campbell,AK Patel,A. (1983) Biochem J. 216:185-194. A homogeneous 
immunoassay for cyclic nucleotides based on chemiluminescence energy transfer. 

2. Roberts,PA KnightJ Campbell,AK. (1987) Anal. Biochem. 160:139-148. Pholasin - a 
bioluminescent indicator for detecting activation of single neutrophils. 

15 3. Mueller,T Davies,EV Campbell,AK. (1989) J. Biolum. Chemilum. 3:105-113. 
Pholasin Chemiluminescence Detects Mostly Superoxide Anion Released from 
Activated Human Neutrophils. 

4. Mueller,T Campbell,AK. (1989) J. Biolum. Chemilum. 5:25-30. The Chromophore of 
Pholasin: A Highly Luinescent Protein. 

20 5. Sala-Newby,G arid Campbell,AK (1991) Biochem. J. 279:727-732. Engineering a 
bioluminescent indicator for cyclic AMP dependent protein kinase . 

6. Campbell, AK, Trewavas, AJ and Knight, MR (1996). Calcium imaging shows 
differential sensitivity to cooling and communication in luminous transgenic plants. 
Cell Calcium 19: 211-218. 

25 7. Kendall, JM, Sala-Newby, G Badminton, M, Campbell AK and Rembold, CR (1996). 

Free Ca 2+ in the endoplasmic reticulum of living cells measured using aequorin as a 
pseudo luciferase. Biochem.J 318:383-387 

8. Badminton, MN, Campbell, AK and Rembold, CR (1996). J.Biol. Chem. 271:31210- 
31214. Differential regulation of nuclear and cytosolic Ca 2+ in HeLa cells. 

30 9. Sala-Newby, GB, Taylor, KT, Badminton, MN, Rembold, CR and Campbell, AK 
(1998). Imaging bioluminescent indicators shows Ca 2+ and ATP permeability 
thresholds in, live cells attacked by complement. Immunology. 93:4:601-609 

lO.Sala-Newby, GB, Kendall, JM, Jones, H, Taylor, KM, Badminton, MN, Llewellyn, 
DH and Campbell, AK (1999). Targeting bioluminescent proteins to defined 
35 compartments fo living cells. Methods in Enzymology Bio luminescence and 
Chemiluminescence. ed Ziegler, M and Baldwin, T. 
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c or genomic DNA coding for apopholasin, with or without the cDNA coding for the signal 
peptide, is amplified by PCR with restriction sites such as BamHl at each end. The cDNA 
5 is run on an agarose gel and the full length DNA eluted and purified. The DNA is then cut 
with BamHl to generate sticky ends and ligated into an expression plasmid such as pET3a, 
which has been cut with BamHl also. After ligation the sealed plasmid is transformed into a 
standard E.coli K12 strain such as JM109, a colony picked off for a large plasmid 
preparation. After checking that the plasmid contains the correct sequence for apopholasin 

10 and is in the correct orientation the plasmid is then used to transform a standard expression 
strain of E.coli such as BL21(DE3) or other expression strain. A colony is picked off the 
agar plate and grown up for 2h in standard LB broth. IPTG is added as inducer for a further 
2h. Apopholasin can then be extracted by breaking the bacteria by lysozyme digestion or 
sonication in a standard salt medium such as 50mM HEPES pH 7 +/- ImM ascorbate. Since 

15 the apopholasin is unglycosylated it tends to aggregate and form inclusion bodies. These 
can be broken using 8M urea or guanidmium chloride and these then dialysed off. If the pH 
of PAGE gels is alkaline this also tends to allow aggregation of both the unglycosylated and 
glycosylated apo- and full pholasin. A signal peptide such P-lactamase signal will target the 
BOIP to the periplasmic space, resulting in the ability to secrete the expressed protein from 

20 the external fluid of the cells. 

EXAMPLE 2: Production of a BOIP in insect cells 

c or genomic DNA coding for apopholasin is inserted into a plasmid suitable for 
25 conversion into baculovirus when transfected into insect cells. Since pholasin is secreted 
by Pholas itself there is a signal peptide at the N-terminus. Removal of this by PCR will 
allow cytosolic expression in insect cells. If the signal peptide is left on or changed for 
honey bee mellitin signal peptide, the apopholasin is secreted into the external medium. 
The virus containing the DNA for apopholasin is then purified and stored until required. 
30 An aliquot is then added to fresh insect cells and these incubated for 3-7 days. The 
apopholasin is then isolated from the supernatant if a signal peptide is used, or from the 
cells is not. The apopholasin can then be purified by ammonium sulphate precipitation, 
gel filtration and DEAE chromatography. The state of glycosylation can be assessed by 



nnOMe t7 PCT/GB99/03654 

WO 00/28025 1" 

running the protein on PAGE when the molecular weight is 34Kda. Removal of the 
glycosylation by enzymes returns the protein to the size of apopholasin 23.5Kda. It can 
be stored frozen or freeze dried, and activated to form pholasin by addition of luciferin as 
described in Example 3. 

5 

Since the apopholasin tends to aggregate in the insect supernatant it is important to get the 
protein into non-aggregating buffer, e.g. 50mM HEPES pH 6, l-10mM ascorbate, as soon 
as possible. 

10 Formation of pholasin can then be achieved as described in Example 3, 
EXAMPLE 3: Generating pholasin and light emission 

To generate light the apopholasin must first be converted into pholasin with the luciferin. 

15 The luciferin can be extracted from native pholasin by mild acid, or by methanol, mild 
acid or alkaline treatment of light organs isolated from Pholas dactylus or the whole 
organism. After homogenisation the extract is centrifiiged or filtered to remove 
particulate material. Further purification can be achieved by tic of hplc. The luciferin is 
best stored dry, but can be stored at -70°C. The intactness and concentration can be 

20 estimated by measuring the absorbance or fluorescence. The details are as follows: 

(a) Isolation of the luciferin 

Four protocols (1-4) have been developed to extract and isolate the luciferin 
responsible for light emission in pholasin. The luciferin is a small organic moiety 
25 tightly bound to apopholasin when pholasin is isolated from Pholas dactylus, but 

also can be found not bound to apopholasin. Thus the extraction procedure isolates 
either form of luciferin. 

1. The organism Pholas dactylus or its light organs are homogenised in 
50mM sodium phosphate pH 6.0 on ice. The pholasin is precipitated with 
30 saturated ammonium sulphate (4°C stirred), and then removed by 

centrifugation at ca 15,000g for 30min in the cold. The supernatant is then 
passed down a SEP-PAK silica column, which binds the luciferin. The 
column is washed with 5ml ethyl acetate and then 5ml of methanol. The 
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active fractions containing the luciferin are assayed either by reactivation 
of the apopholasin or by chemiluminescence in DMSO, DMF, or NaOCl. 
The luciferin is concentrated and can be purified further on tic or hplc with 
a standard solvent. It is dried and stored at -70°C. 

The organism Pholas dactylus or its light organs are homogenised in cold 
acetone on ice, filtered through a Buchner funnel, and extracted with 
methanol: acetone (1:1), the residual powder being extracted 3 times with 
methanol and extracts combined. These are then concentrated in a 
Rotavaporator and left to stand for Ih on ice to allow further precipitation. 
The suspension is then refiltered and concentrated. The solution containing 
the luciferin is then passed down a SEP-PAK silica column which binds 
the luciferin. The column is washed with 5ml ethyl acetate and then 5ml of 
methanol. The active fractions containing the luciferin are assayed either 
by reactivation of the apopholasin or by chemiluminescence in DMSO, 
DMF, or NaOCl. The luciferin is concentrated and can be purified further 
on tic or hplc with a standard solvent. It can be dried and stored at -70°C. 
The organism Pholas dactylus or its light organs are homogenised in cold 
acetone on ice, and filtered through a Buchner funnel to give an acetone 
powder. This is then extracted with methanol :acetone (1:1), twice for 
lOmin and then 3 times with methanol. The extracts are combined and 
concentrated in a Rotavaporator. They are left to stand for lh on ice to 
allow further precipitation, refiltered and concentrated. The residual 
powder is resuspended in 50mM sodium phosphate pH 6.0, lOmM 
ascorbate, and ultrafiltered with a lOkD Amicon membrane at 4 C for 
pholasin. The solution containing the luciferin is then passed down a SEP- 
PAK silica column which binds the luciferin. The column is washed with 
5ml ethyl acetate and then 5ml of methanol. The active fractions 
containing the luciferin assayed either by reactivation of the apopholasin or 
by chemiluminescence in DMSO, DMF, or NaOCl. The luciferin is 
concentrated and can be purified further on tic or hplc with a standard 
solvent. It is dried and stored at -70°C. 

The organism Pholas dactylus or its light organs are homogenised in 
50mM HEPES buffer, with methanol and lOOmM HC1 on ice, and 
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incubated for 2h on ice. After centrifugation at ca 15,000g for 30min in the 
cold, the supernatant is then passed down a SEP-PAK silica column which 
binds the luciferin. The column is washed with 5ml ethyl acetate and then 
5ml of methanol. The active fractions containing the luciferin are assayed 
5 either by reactivation of the apopholasin or by chemiluminescence in 

DMSO, DMF, or NaOCl. The luciferin is concentrated and can be purified 
further on tic with a standard solvent. It is dried and stored at -70°C. 
Method 4 normally generates most luciferin. The luciferin is characterised by its 
absorbance and fluorescence spectrum, and by its chemiluminescence with 
10 DMSO, NaOCl and apopholasin. 

(b) Generation of pholasin from apopholasin and the luciferin 

A small sample of the luciferin (l-10ul) is added to apopholasin in an appropriate 
buffer (50mM HEPES pH 6-7.5, +/- 0.1% gelatine, +/- 1-lOmM ascorbate, or 
500mM NaCl, lOmM TES, ImM EDTA, ImM mercaptoethanol pH 6-7.5). The 
15 mixture is incubated at room temperature for up to 24h, and the pholasin assayed 

by adding an oxygen metabolite, e.g. NaOCl, or luciferase to a sample. When 
apopholasin has been expressed in cells, the luciferin is added externally, 
microinjected into individual cells or added via liposomes to get the luciferin into 
the cell. 



20 



25 



Light is detected and quantified in a standard luminometer, imaging camera 
(intensified or CCD), or by a silicon chip. 

EXAMPLE 4: Production of a BOIP in vitro 



c or genomic DNA coding for apopholasin, with or without the signal peptide, is 
amplified by PCR with the 5' primer containing the DNA coding for T7 RNA 
polymerase. The DNA product is purified and precipitated. After dissolving in lOmM 
tris/lmMEDTA pH7, the DNA is added to a standard in vitro transcription/translation 
30 system such as rabbit reticulocyte lysate or wheat germ agglutinin and incubated at 30°C 
for 30-60min. The apopholasin can then be purified and activated to form pholasin as 
described in Example 3. 
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The BOIP can also be incorporated into a defined part of a live cell by chemical means or 
by genetically engineering the BOIP to contain a signal peptide which locates the BOIP to 
5 the inner or outer surface of the plasma membrane or within a particular organelle such as 
peroxisome, mitochondrion, chloroplast, tonoplast, endoplasmic reticulum, Golgi, 
endosome, lysosome, secretory vesicle, nucleus, nucleolus, proteosome, or gap junction, 
or structure such as microtubule, cytoskeleton, nuclear skeleton, nuclear receptor, or 
mitotic spindle. The signal peptide, added either chemically or genetically, will normally 

1 0 target the normal or altered BOIP to a particular intra- or extra-cellular site for example, 
the sequence MLSRLSLRLLSRYLL or part of cytochrome oxidase on the N-terminus 
will target the BOIP to the mitochondrion; KKSALLALMYVCPGKADKE or 
MLLPVPLLLGLLGLAA or the ER protein calreticulin at the N-terminus will target the 
BOIP to the endoplasmic reticulum, a KDEL or HDEL sequence at the C-terminus 

15 retaining it there. SKL at C-terminus targets BOIP to the peroxisome, PKKKRKV or an 
extension of this SV40 large T-antigen signal will target it to the nucleus, and a 
palmitoylation and/or a myristoylation signal (MGCVCSSNPD - the LCK N-terminal 
acylation motif from tyrosine kinase) will target it to the plasma membrane. By coupling 
the BOIP to another protein which targets itself to a particular site then the BOIP is also 

20 targeted here. For example, coupling the nuclear proteins nucleoplasmin or lamin B 
receptor to BOIP targets it to the nucleus; cytochrome oxidase at the N-terminus targets 
BOIP to the mitochondria; chlorophyll at the N-terminus targets BOIP to the chloroplast; 
and a connexin at the N-terminus targets BOIP to the gap junction or plasma membrane, 
SNAP 25 to the plasma membrane. 

25 

In order to target pholasin to defined sites in living cells, the DNA coding for these 
targeting sequences are added by using PCR. For cytosolic apopholasin the native signal 
peptide is removed and also the BOIP can be linked to larger proteins at the N- or C- 
terminus such as firefly luciferase or aequorin to prevent it getting into the nucleus. This 
30 also enables ATP and oxygen metabolites, or Ca 2+ and oxygen metabolites to be measured 
simultaneously in the same cells by intensity, colour or polarisation of the different 
bioluminescent indicators. A multiple bioluminescent indicator can also be engineered by 
PCR, or by using restriction enzyme sites, from the DNA coding for 3 or more 
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bioluminescent proteins. A simple screen of the transformed bacteria enables the multiple 
rainbow protein to be isolated with 2-3 colours or more. 

The DNA is then added to an in vitro transcription/translation system as described in 
5 Example 4 in the presence of the organelle to be targeted (e.g. microsomes for the 
endoplasmic reticulum, which glycosylate apopholasin). 

The new DNA can also be inserted into a plasmid by standard techniques, and 
transformed into bacteria or transfected or injected into eukaryotic cells such as HeLa or 



Addition of the luciferin as described in Example 3 allows formation of pholasin which 
can then be detected by light emission. Changes in oxygen metabolite production are then 
be detected by a luminometer or imaging camera when the cells are exposed to external 
15 oxygen metabolites, a change in oxygen concentration, addition of stimuli e.g. TNF, EGF, 
hormones or drugs, or attack by pathogens such as bacteria, viruses, complement, 
antibodies, toxins, and cells of the immune system. 

EXAMPLE 6: Engineering a covalent modification site into a BOIP 



(a) The site coding a protein kinase A (RRAS or kemptide), protein kinase C 
(MARCKS), MAP kinase, ERK, the ER - nuclear signalling kinase IRE IP or a 
phosphatase is added to the N- or C-terminus or inserted at various sites within the 
apopholasin by PCR and expressed as described in Examples 1-5. Pholasin is 



10 



COS. 
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25 



then generated by addition of the luciferin as described in Example 3, 



Addition of the catalytic subunit for protein kinase A, or activation via cyclic 
AMP inside cells, leads to phosphorylation or dephosphorylation of the modified 
pholasin and change in light emission (intensity, colour or polarisation). 



30 



A preliminary screen is necessary to select the appropriate proteins and to discard 
any which have lost all activity. 
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(b) The site coding a protease (thrombin, enterokinase, HIV protease, caspase) is 
added to the N- or C-terminus of the apopholasin by PCR or inserted at various 
sites within the protein, and expressed as described in Examples 1-5. Pholasin is 
then generated by addition of the luciferin as described in Example 3. 

5 

EXAMPLE 7: Engineering a BOIP into a "Rainbow Protein" 

cDNA coding for apopholasin is linked to another protein by using the cDNA coding for 
that protein. For example, wild type GFP, the S65T mutant of the green fluorescent 

1 0 protein, YGFP, or EGFP are linked to apopholasin by PCR at the N- or C-terminus, or by 
splitting one or both proteins using multi-step PCR. In between there is a 'reactive' 
peptide with a protease site (a thrombin or enterokinase) and a binding site for IP 3 , or the 
15 amino acid sequence form IP 3 kinase (an IP 4 binding site). At the C-terminus of the 
GFP, a peptide containing 6 lysine residues may also be added via PCR. The protein is 

15 expressed and fluorescein covalently linked to these lysines by addition of fluorescein 
isothiocyanate. Addition of the luciferin forms pholasin as described in Example 3. The 
change in colour occurs by chemiluminescence resonance energy transfer. Without 
fluorescein the rainbow protein emits blue-green light (508nm), which changes to blue 
(490nm) when the reactive substance binds to the reactive peptide, or when either 

20 thrombin or enterokinase is added. When the 6 amino acid linker is used the colour starts 
as green (530nm), and will then change from green, to blue-green and then blue as the 
particular reactive sequence binds their respective analytes. Use of rhodamine instead of 
fluorescein generates a rainbow protein which changes from red to green to blue. 

25 A preliminary screen is necessary to select the appropriate rainbow proteins and to discard 
any which have lost all activity. 

The other protein linked to apopholasin may be, for example, any one of the following 
linked chemically or genetically: 

30 

1 . Firefly or any benzothiazole luciferase to the N or C terminus gives two colours 
for ATP and oxygen metabolites. 
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2. Any imidazolopyrazine luciferase, including coelenterazine systems - decapod 
shrimp, fish, sqiud, Renilla, anthzoan, Chaetognate, radiolarian, or copepod and 
Vargula systems - ostracod, Porichthys and similar fish, cypridinids and Vargula. 

3. Any tetrapyrrole luciferase such as dino flagellate, euphausiid or stomiatoid fish. 

5 4. Bacterial luciferase and other aldehyde or flavin luciferases, including polychate 
worm. 

5. Any GFP, including wild type, S65T, enhanced GFP, blue GFP, yellow GFP, 
Renilla GFP, Pilocarpus GFP, and Pennatula GFP, any anthozan GFP, or any 
coelenterate GFP. 

10 6. The red fluorescent proteins from stomiatoid fish - Malactosteus, Aristostomias, 
Photostomias. 

7. The phycobiliproteins - phycoerythrin and phycocyanobilin. 

8. The blue fluorescent lumazine protein in the bacterium Photobacterium. 

9. The yellow flavin fluorescent protein in Y Vibrio. 

15 10. Any lysine or argininine or other amino acid side chain where a fluor can be added 
covalently. In which the case the rainbow protein, amy emiot more than two 
colours. For example, rhodamine on a pholasin-linker-GFP chimera will turn from 
red to green to blue. 

20 A preliminary screen may be necessary to select chimeras which have not lost all 
bioluminescent activity. 

The 'reactive' peptide may be a binding site for any analyte, protein or DNA, metabolite, 
substrate vitamin, an enzyme such a protein kinase C or phosphatase, ion channel, ion 

25 pump, antigen, antibody, nucleotide or nucleoside such as ATP, GTP, ADP, AMP, 
adenosine, cAMP, cCMP, cCCP or their deoxy equivalents, and inositol phosphates such 
as IP 3 or IP 4 a lipid such as diacyl glycerol, phosphatidyl inositol bisphosphate, 
phosphate, a cation such as Ca 2+ , K + or Na + , Cu 2+ or Zn 2+ , or anion such as CX , sulphate, 
or gas such as NO, 0 2 or H 2 , or a protein binding site such as calmodulin, kinesin, dynein, 

30 tubulin, or myosin. 



When pholasin is triggered by oxygen metabolites, the Pholas luciferase or peroxidase, 
energy transfer occurs from pholasin oxyluciferin through GFP to fluorescein resulting in 
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a yellow emission. Addition of thrombin for 3h cleaves the GFP-fluorescein from the 

pholasin and the light emission returns to the blue of native pholasin. Addition of IP 3 to 

the full chimera alters the efficiency of energy transfer. As a result there is a change in 

the ratio of light emitted in the yellow to blue. This ratio is directly related and can be 

5 plotted against the concentration or amount of analyte. The light is detected in a dual 

wavelength luminometer or ratiometric imaging camera and the ratio of blue to green 

light measured. 

Alternatively any fluors can be used, and any binding sites with the right characteristics as 
10 shown in these examples will work provided a simple screen is used to select the right 
chimeras. 

EXAMPLE 8: Engineering a BOIP into a "Rainbow Protein" for two analytes 
together 

15 

Apopholasin is linked to firefly luciferase by using cDNAs and PGR, followed by 
expression in insect cells as described in Example 2. Addition of the luciferin as 
described in Example 3 generates the pholasin. In the presence of firefly luciferin (lmM), 
ATP and oxygen metabolites, this chimera emits blue and yellow simultaneously which 
20 can be independently measured by using a dual wavelength luminometer or imaging 
camera. 



EXAMPLE 9: Expression of BOIPs in mammalian cells 

25 Apopholasin, c or genomic, in an expression plasmid with the CMV promoter, is 
transfected into HeLa cells. After incubation for 3 days to allow expression of the 
apopholasin, the luciferin is added to form pholasin. Expression is checked using a 
polyclonal antibody to pholasin raised in rabbits. Addition of oxygen metabolites outside 
the cell allows the permeability of the plasma membrane to oxygen metabolites to be 

30 assessed. As the oxygen metabolites permeate into the cytosol, the light emission 
increases. 
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c or genomic DNA coding for apopholasin is inserted into a plasmid with the cauliflower 
mosaic virus promoter and transformed into Agrobacterium tumificans. These are then 

5 added to a tobacco leaf, seedlings generated, and those expressing apopholasin selected. 
The plants are grown to seed, and seedlings grown from this seed. Addition of luciferin 
forms the pholasin as described in Example 3. Stressing the plant, e.g. with wind, touch, 
cold, or peroxide, or during growth and development or by a hormone, generates light, 
showing the formation of oxygen metabolites within the live plant. A cell-specific 

10 promoter engineered on to the apopholasin cDNA before making the transgenic plant 
enables oxygen metabolites to be detected in specific cells within the whole, living plant. 

EXAMPLE 11: Detection of oxidative damage in vitro 

15 Addition of pholasin to serum or plasma from a rat, mouse or human enables oxygen 
metabolites to be detected and measured on addition of a drug or other substance of 
interest. 

EXAMPLE 12: Detection of ROMs in a heart cells 

20 

Reperfusion has been proposed to lead to oxygen metabolite damage in cardiac myocytes. 
Pholasin allows this to be tested for the first time. Plasmid containing apopholasin cDNA 
and the CMV promoter is transfected into isolated cardiac myocytes in culture. 
Expression occurs within 1-3 days, and pholasin is formed by addition of the luciferin as 
25 described in Example 3. Subjecting the cells to hypoxia followed by readmission of 
normal oxygen leads to light emission, showing that oxygen metabolites have been 
generated inside the cells. By using an imaging camera, the digital or analogue nature of 
this can be assessed as the number of cells emitting light can be visualised and counted. 



WO 00/28025 26 PCT/GB99/03654 

EXAMPLE 13: Detecti n of ROMs in the nucleus and endoplasmic reticulum (ER) 

Plasmid-containing apopholasin cDNA with either nucleoplasmin DNA or calreticulin 
DNA (with or without KDEL on the C-terminus) linked to the pholasin DNA, to target 

5 the apopholasin to the nucleus or ER respectively, and the CMV promoter for expression, 
is transfected into HeLa cells in culture. Expression occurs within 1-3 days, and pholasin 
is formed by addition of the luciferin as described in Example 3. Addition of oxygen 
metabolites outside the cells, or hypoxic/oxygen shock generates light measured in a 
luminometer, showing how fast oxygen metabolites penetrate into these organelles. By 

10 imaging with a photon counting imaging camera, the number of cells permeable to 
oxygen metabolites can be counted. Location of the pholasin can be assessed by imaging 
live cells, or by using immunofluorescence with the pholasin antibody on partially-fixed 
cells or GFP-pholasin in live cells. Using a rainbow protein, two or more analytes can be 
detected together. 

15 

EXAMPLE 14: Use of pholasin as a protein label 

Pholasin can be used as a label in homogeneous or heterogeneous immunoassay. 
Apopholasin is first covalently linked to an antibody to HIV, and pholasin formed by 
20 addition of luciferin as described in Example 3. The antibody is then used in a standard 
chemiluminometric immunoassay format. Addition of HIV antigen leads to an increase 
in antibody binding and an increase in light emission dependent on the amount of HIV 
added. The amount of HIV in a blood sample can be assessed by relating the pholasin 
light emission in the sample to the standard curve. 

25 

EXAMPLE 15: Pholasin as a DNA label 

Apopholasin is covalently linked to an oligonucleotide probe for detecting the presence of 
the cystic fibrosis gene. Addition of the probe to DNA in a standard Southern blot allows 
30 the probe to bind when the gene is present. Addition of luciferin as described in Example 
3 allows the pholasin to form. Addition of hypochlorite (lOmM) in barbitone buffer pH 9 
causes the pholasin to flash and the gene can be visualised by the photon counting 
imaging camera. 
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1 . An isolated, purified or recombinant nucleic acid sequence comprising: 

(a) a sequence that encodes the apophotoprotein of pholasin (alternatively, 
5 'apopholasin'); 

(b) sequence substantially homologous to or that hybridises to sequence (a) under 
stringent conditions; or 

(c) a sequence substantially homologous to or that hybridises under stringent 
conditions to the sequence (a) or (b) but for the degeneracy of the genetic code; or 

10 (d) an oligonucleotide specific for any of the sequences (a), (b) or (c). 

2. A sequence according to claim 1, wherein the sequence that encodes for 
apopholasin is as shown in Figure 4B. 

3. A sequence according to claim 1, wherein the sequence that encodes for 
apopholasin is as shown in any one of Figuresl, 2, 3, 4A, 6 or 9. 

15 4. A sequence according to any preceding claim, wherein the apopholasin is non- 
glycosylated. 

5. A sequence according to any preceding claim, wherein the apopholasin is 
glycosylated. 

6. An isolated, purified or recombinant construct incorporating a sequence encoding 
20 apopholasin protein according to any preceding claim. 

7. An isolated, purified or recombinant construct incorporating a sequence encoding 
an apophotoprotein whose expression in a substrate, in association with a luciferin 
therefor, signals the presence of oxygen or an oxygen metabolite in the substrate. 

8. An isolated, purified or recombinant construct incorporating a sequence encoding 
25 an apophotoprotein whose expression in a substrate, in association with a luciferin 

therefor, signals the presence of oxygen or an oxygen metabolite in the absence of 
a corresponding luciferase in the substrate. 

9. A recombinant construct according to any one of claims I to 8, wherein the 
nucleic acid sequence is linked operably with nucleotides enabling expression and 

30 secretion of the apopholasin in a cellular host. 

10. DNA or RNA according to any of claims 1 to 9. 

11. An isolated, purified or recombinant polypeptide comprising apophotoprotein of 
pholasin (apopholasin) or a mutant or variant thereof having substantially the 
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EXAMPLE 16; Pholasin in a two hybrid system 

Protein-protein interaction can be detected by engineering apopholasin to one half of a 
5 two hybrid system and GFP to the other. Binding will allow the yeast to grow. 

EXAMPLE 17: Pholasin in genetic entertainment 

Pholasin is able to chemiluminesce at a wide range of pH (3-10), including acid pH such 
10 as 3-4. Thus it can be added to drinks such as beer, cola, soft drinks, and spirits to make 
them glow. It can also make food glow by adding to them to the ingredients of cakes, 
icing, popcorn; by painting the pholasin or apopholasin on to the food, or by making it 
genetically in the source of the food. It can be used in a wide range of toys and other 
entertaining devices including squirt guns, greeting cards, pens. 

15 

The rainbow proteins can also be used as an alternative to pholasin alone, resulting in a 
rainbow of colours and colour changes. 

EXAMPLE 18: Pholasin in trangenic animals 

20 

Transgenic animals such as nematodes, mice or plants can be generated from apopholasin 
cDNA by standard techniques. Injecting the luciferin or incubating whole plant in it forms 
the active pholasin. Oxygen or its metabolites can then be detected, measured and imaged, 
in an intact organ, or from the whole organism. It can also be used in humans, in DNA 
25 therapy or diagnosis. 

EXAMPLE 19: Apoprotein from the luminous squid Ommastrophes 

The use of apoprotein from the luminous squid Ommastrophes is substituted for 
30 apopholasin, and the methods of Examples 1 to 18, above, are carried out. 

EXAMPLE 20: Apoprotein from the mollusc Rocellaria 
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The use of the apoprotein from the mollusc Rocellaria is substituted for apopholasin, and 
the methods of Examples 1 to 18, above, are carried out. 

EXAMPLE 21: Earthworm luciferase 

5 

The use of earthworm luciferase as a BOIP is substituted for apopholasin, and the 
methods of Examples 1 to 18, above, are carried out. 

10 EXAMPLE 22 



Genomic DNA from Pholas, Rocellaria, Ommastrophes, or earthworm is substituted for 
the recombinant protein in Examples 1 to 18, above, the methods of which are carried out 
in an analogous manner. 
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same activity as apopholasin. 

12. An isolated, purified or recombinant polypeptide according to claim 11 
comprising the amino acid sequence of Figure 4 or Figure 5, 

13. The apopholasin according to claim 11 or claim 12 when expressed by 
5 recombinant DNA or RNA according to claim 10. 

14. The apopholasin according to claim 13, which is non-glycosylated. 

15. A cell, plasmid, virus or live organism that has been genetically engineered to 
produce an apoprotein, said cell, plasmid, virus or live organism having 
incorporated expressibly therein a sequence according to any one of claims 1 to 

10 10. 

16. A vector comprising a sequence according to any one of claims 1 to 10. 

17. A host cell transformed or transfected with a vector according to claim 16. 

18. A BOIP, as defined herein, comprising an apophotoprotein according to any one 
of claims 1 1 to 14 in association with a luciferin. 

15 19. A BOIP according to claim 18, wherein the luciferin is derived from Pholas 
dactylus. 

20. A method for the preparation of a BOIP, as defined herein, which method 
comprises bringing an apophotoprotein, such as recombinant apopholasin, into 
association with a luciferin therefor, such as a luciferin derived from Pholas 

20 dactylus. 

21. A BOIP, apophotoprotein thereof, or a nucleic acid sequence encoding either of 
these, which comprises a sequence according to any one of Figures 2 to 6 or 9 that 
has been chemically or genetically modified. 

22. A method for the detection and/or measurement of oxygen or one of its 
25 metabolites extracellularly, which method comprises providing a BOIP, such as 

native or chemically- or genetically- modified BOIP or a 'rainbow protein' based 
on such a BOIP, extracellularly and thereafter detecting and/or quantifying light 
emission therefrom and/or changes in colour, intensity and/or polarisation of 
emission(s), wherein the apophotoprotein comprises recombinant apopholasin. 

30 

23. A method for the detection and/or measurement of oxygen or one of its 
metabolites in live cells (intracellularly), which method comprises providing a 
BOIP, such as native or chemically- or genetically- modified BOIP or a 'rainbow 
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protein* based on such a BOIP, intracellularly and thereafter detecting and/or 
quantifying light emission therefrom and/or changes in colour, intensity and/or 
polarisation of emission(s) therefrom. 

24. A method according to claim 22 or 23, wherein said BOIP includes a signal 
5 peptide, targetting it to a pre-determined extra- or intra- cellular site. 

25. A method according to claim 22 or claim 23, comprising incubating a test sample 
with a cell according to claim 15 or with a membrane preparation derived 
therefrom. 

26. A method according to any one of claims 22 to 24, wherein light emission takes 
1 0 place in the absence of a luciferase. 

27. The use of a sequence or a protein according to any one of claims 1 to 21 in the 
detection, diagnosis or measurement of oxygen or a metabolite thereof. 

28. A diagnostic kit incorporating a sequence or protein according to any one of 
claims 1 to 2 1 . 

15 29- A process for obtaining a substantially homologous source of apopholasin, which 
comprises culturing cells having incorporated expressibly therein a polynucleotide 
encoding apopholasin as defined in any one of claims 1 to 10, and thereafter 
recovering the cultured cells. 
30. A method, use or kit according to any one of claims 20 to 27, substantially as 

20 hereinbefore described with particular reference to the Examples. 
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Clone 40: 

GAATTCGGCA CGAGTC GGAAAAGAAC AAAATGGCTTGT AT CGTTTT C GTT 
GCTCTTGTCGCTCTATGCTTAATGCAACCGGGTTCCGGTGAGGAAGTACA 
ATGCGCGATGAATTGGACACAAGCTAATGAATATGTGTTCAACGTGGACT 
GGATGACCATTTTCATCTACGACTATGGCGCTCAAGAGCAACTGTACGAA 
GATCGGGCTTTGGGGCTGTGTCGGATTGAACGGGCCGGCCCAGGTACCAC 
AAAAGCCGTCTGGATTAACTGGAGTAACGACACGCAGTCATGTGTAACAA 
GAAAAACAAT CTTCTT C GAGGTT GGTGGAGAAATTGCC CGGCTAGTT G AC 
TACAGAC CACAGGAAGAC GGAACTGAGAAAACTTTTACAAGAAAATTCTC 
TAGCAAAATGCCAGGCACTTACATGCTTATGGACGTGTGCGCTACAAGGG 
ACGCTG ATGAT AAAT GC ATCG AAGGC AC AATT GT GGT GAC AGT C AGGGT G 
TCCCTAT AT G ACGAAGAT AACAATGGT GT AAT GGATGAAGGT AAGGTGAT 
TCCATCTG AG AC AATCGAGG AT GAT AT C AAGG ACTGT GGGCT CTT AGACC 
AAGATGTTGAACTCGATTATACGTGGACTCAAAACGAGTGTGATCTACCA 
GAC ACAGT AG AC GAGGCTG AAG ACAC ACCGT C AG AAACTGGAGAATT CTT 
CT GGTAGAT CTATC AGA CTACTTTTAT CAGCAGGACAACTG GT CGTT ACC 
AGACAC CT AT AACGTGT C CT CAT C AAT AAT GTGT AAAAC AG AAATAAT CG 
ATAGAATATTGAAAATAAAATGTTAATAAACACTGGTTGAAATATGAAAA 
AAAAAAAAAAAAAA C TCGAG 

Clone 3: 

GAATTCGGCACGAGGGAAAAGAACAAAATGGCTTGTATCGTTTTCGTT 
GCTCTTGTCGCTCTATGCTTAATGCAACCGGGTTCCGGTGAGGAAGTACA 
ATGC GC GATG AATT GGAC AC AAGCT AATGAAT AT GT GTTC AAC GTG GACT 
GGATGACCATTTTCATCTACGACTATGGCGCTCAAGAGCAACTGTACGAG 
GATCGGGCTTTGGGGCTGTGTCGGATTGAACGGGCCGGCCCAGGTACCAC 
AAAAGC C GTCT GG ATT AACT G G AGT AAC GAC AC G C AGT CAT GT GT AAC AA 
GAAAAACAATCTTCTTCGAGGTTGGTGGAGAAATTGCCCGGCTAGTTGAC 
TACAGAC CAC AGGAAG ACG G AACT G AGAAAACTTT T ACAAG AAAATT CT C 
TAGCAAAATGCCAGGCACTTACATGCTTATGGACGTGTGCGCTACAAGGG 
ACGCT GAT GAT AAAT GC AT CGAAGGC ACAATTGT GGTG ACAGTC AGGGTG 
TCC CT AT AT GACG AAGAT AACAAT GGTGT AATGG ATG AAGGT AAGGTTAT 
TC C AT CT GAG AC AATCGAGGATG AT ATCAAGG ACTGTGGGCT CTT AG AC C 
AAGATGTT G AACTCGATT AT ACGT GGACT C AAAAC G AGTGT G AT CT ACC A 
GACACAGTAGACGAGGCT GAAGACAC AC CG TCAGAAACTGGAGAATTCTT 
CTGGTAGATCTATCAGACCACTTTTATCAGCAGGACAACTGGTCGTTACC 
AGACAC CTAT AACGT GTC CT CAT CAAT AAT GTGT AAAAC AG AAAT AAT C G 
ATAGAATATTGAAAATAA 



Clone 5: 

GTCGGAAAAGAACAAAATGGCTTGT ATCGTTTT CGTTGCT CTT GTCGCT CTAT GCTTAATGCAACCGGG 
TT CC GGTGAG G AAGT AC AATGC G C GAT GAATT GG ACAC AAGCT AATGAAT AT GT GTTC AACGTGGACTG 
GATGACCATTTTCATCTACGACTATGGCGCTCAAGAGCAACTGTACGAGGATCGGGCTTTGGGGCTGTG 
TCGGATTGAACGGGCCGGCCCAGGTACCACAAAAGCCGTCTGGATTAACTGGAGTAACGACACGCAGTC 
AT GTGT AAC AAG AAAAACAATCTT CTT C GAGGTTGGT GG AGAAATTGC CCGG CT AGTT G ACT ACAGACC 
ACAGGAAGACGGAACTGAGAAAACTTTTACAAGAAAATTCTCTAGCAAAATGCCAGGCACTTACATGCT 
TATGGACGTGTGCGCTACAAGGGACGCTGATGATAAATGCATCGAAGGCACAATTGTGGTGACAGTCAG 
GGTGTCCCTAT ATG ACGAAGAT AACAAT GGT GTAAT GGATG AAG GT AAG GTTAT T C C ATCT G AGACAAT 
C G AGG AT GAT AT C AAGG ACTGT GGGCT CTT AGACC AAG AT GTT G AACT CG ATT AT ACGTGG ACT CAAAA 
CG AGTGTGAT CT ACC AG AC AC AGT AG ACG AGGCT GAAGACAC AC CGT C AGAAACT GGAGAATT CTTCT G 
GTAGATCTATCAGACCACTTTTATCAGCAGGACAACTGGTCGTTACCAGACACCTATAACGTGTCCTCA 
TCAATAATGTGTAAAACAGAAATAATCGATAGAATATTGAAAATAAAATGTTAATAGACACTGGTTGAA 
AAAAAAAAAAAAAAAAAA CTCGAG 
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clone 40 GAATrCGGCACGAGTCGGAAAAGAACAAAATGGCTTGTATCGTTTTCGTT 

clone 3 GAATTCGGCACGAG — GGAAAAGAACAAAATGGCTTGTATCGTTTTCGTT 

clone 5 GTCGGAAAAGAACAAAATGGCTTGTATCGTTTTCGTT 

clone 40 GCTCTTGTCGCTCTATGCTTAATGCAACCGGGTTCCGGTGAGGAAGTACA 

clone 3 GCTCTTGTCGCTCTATGCTTAATGCAAGCGGGTTCCGGTGAGGAAGTACA 

clone 5 GCTCTTGTCGCTCTATGCTTAATGCAACCGGGTTCCGGTGAGGAAGTACA 



clone 40 ATGCGCGATGAATTGGACACAAGCTAATGAATATGTGTTCAACGTGGACT 
clone 3 ATGCGCGATGAATTGGACACAAGCTAATGAATATGTGTTCAACGTGGACT 
clone 5 ATGCGCGATGAATTGGACACAAGCTAATGAATATGTGTTCAACGTGGACT 



clone 40 GGATGACCATTTTCATCTACGACTATGGCGCTCAAGAGCAACTGTACGAA 
clone 3 GGATGACCATTTTCATCTACGACTATGGCGCTCAAGAGCAACTGTACGAG 
clone 5 GGATGACCATTTTCATCTACGACTATGGCGCTCAAGAGCAACTGTACGAG 



clone 40 GATCGGGCTTTGGGGCTGTGTCGGATTGAACGGGCCGGCCCAGGTACCAC 
clone 3 GATCGGGCTTTGGGGCTGTGTCGGATTGAACGGGCCGGCCCAGGTACCAC 
clone 5 GATCGGGCTTTGGGGCTGTGTCGGATTGAACGGGCCGGCCCAGGTACCAC 



clone 40 AAAAGCCGTCTGGATTAACTGGAGTAACGACACGCAGTCATGTGTAACAA 
clone 3 AAAAGCCGTCTGGATTAACTGGAGTAACGACACGCAGTCATGTGTAACAA 
clone 5 AAAAGCCGTCTGGATTAACTGGAGTAACGACACGCAGTCATGTGTAACAA 



clone 40 GAAAAAC AAT CTTCTT CGAGGTTGGT GGAGAAATT GC CCGG CT AGTTGAC 

clone 3 . GAAAAACAATCTTCTTCGAGGTTGGTGGAGAAATTGCCCGGCTAGTTGAC 
clone 5 GAAAAACAATCTTCTTCGAGGTTGGTGGAGAAATTGCCCGGCTAGTTGAC 



clone 40 TACAGACCACAGGAAGACGGAACTGAGAAAACTTTTACAAGAAAATTCTC 
clone 3 TACAGACCACAGGAAGACGGAACTGAGAAAACTTTTACAAGAAAATTCTC 
clone 5 TACAGACCACAGGAAGACGGAACTGAGAAAACTTTTACAAGAAAATTCTC 



clone 40 
clone 3 
clone 5 



TAGCAAAATGCCAGGCACTTACATGCTTATGGACGTGTGCGCTACAAGGG 

TAGCAAAATGCCAGGCACTTACATGCTTATGGACGTGTGCGCTACAAGGG 

TAGCAAAATGCCAGGCACTTACATGCTTATGGACGTGTGCGCTACAAGGG 
***+*********+*+********+****+****************+*** 



clone 40 
clone 3 
clone 5 



ACGCTGATGATAAATGCATCGAAGGCACAATTGTGGTGACAGTCAGGGTG 
ACGCTGATGATAAATGCATCGAAGGCACAATTGTGGTGACAGTCAGGGTG 
ACGCT GATG AT AAAT GC ATC GAAGG C ACAATT GTGGT GAC AGTCAGGGT G 



clone 40 
clone 3 
clone 5 



TCCCTATATGACGAAGATAACAATGGTGTAATGGATGAAGGTAAGGTGAT 
TCCCTATATGACGAAGATAACAATGGTGTAATGGATGAAGGTAAGGTTAT 
TCCCTATATGACGAAGATAACAATGGTGTAATGGATGAAGGTAAGGTTAT 



clone 4 0 TCCATCTGAGACAATCGAGGATGATATCAAGGACTGTGGGCTCTTAGACC 
clone 3 T CC AT CTGAG AC AATC G AGGAT GAT AT CAAGGACTGTGGG CT CTT AGAC C 

clone 5 TCC AT CTGAGACAATCG AGGAT GAT AT C AAGGACTGTGGGCTCTT AGAC C 



Fig. 2 (Part 1 of 2) 
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clone 40 
clone 3 
clone 5 



clone 40 
clone 3 
clone 5 



clone 40 
clone 3 
clone 5 



clone 40 
clone 3 
clone 5 



clone 40 
clone 3 
clone 5 



clone 40 
clone 3 
clone 5 



AAG AT GTTG AACT CGATT AT AC GT GGACT C AAAAC GAGTGTG AT CT AC CA 
AAG ATGTTG AACT CGATT AT ACGT G GACT C AAAACGAGT GT GAT CT ACC A 
AAGATGTT G AACT CGATT AT ACGTG G ACT CAAAAC GAGTGTGATCT ACC A 



GACACAGTAGACGAGGCTGAAGACACACCGTCAGAAACTGGAGAATTCTT 
GACACAGTAGACGAGGCTGAAGACACACCGTCAGAAACTGGAGAATTCTT 
G AC AC AGT AGACG AGGCTGAAG AC AC AC CGT CAG AAACT G G AGAATTCTT 



CTGGTAGAT CT AT C AG ACT ACTTTT ATC AGC AG G AC AACT GGTC GTT AC C 
CT GGTAGAT CT AT C AGACC ACTTTT ATC AGCAGG AC AACT GGT CGTT AC C 
CTGGTAGAT CT AT CAGACC ACTTTT AT C AGC AGGAC AACTGGT CGTT AC C 



AGAC AC CT AT AACGTGT C CT C ATCAAT AATGT GT AAAACAG AAAT AAT C G 
AG ACAC CT AT AACGTGT CCT C AT C AAT AAT GTGT AAAAC AG AAATAATCG 
AGACAC CT AT AACGT GT CCTC ATC AAT AATGT GT AAAAC AG AAAT AATCG 

ATAG AATATTGAAAATAAAAT GTT AAT AAACAC TGGTT GAAAT ATGAAAA 

ATAGAATATTGAAAATAA 

ATAG AAT AT T GAAAAT AAAAT GTT AAT AG AC ACT GGTT GAAA AAA 



AAAAAAAAAAAAAACTCGAG 



AAAAAAAAAAAAAACrCGAG 
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GAA TTCGGCACGACT CGGAAAAGAACAAAATGGCTTGTATCGTTTTCGTT GCT CTT G 

8S 

T CGCTCTAT GCTT AATGCAACC GGGTT C CGGT GAGGAAGT ACAAT GC GCGATGAATT 

GGACACAAGCTAAT GAATAT GT GT TCAAC GT GGACTGGAT GAC CATTTTCATCTACG 

ACTATGGCGCT CAAGAGCAACT GTACGAAGATCGGGCTTTGGGGCTGTGTCGGATTG 

3A 

AACGGGCCGGCCCAGGTACCACAAAAGCCGTCTGGATTAACTGGAGTAACGACACGC 

AGT CATGT GT AACAAGAAAAACAATCTT CTT CGAGGTT GGTGGAGAAATTGCCCGGC 

4S 

TAGTTGACT ACAGACCACAGGAAGACGGAACT GAGAAAACTTTTACAAGAAAATT CT 

CTAGCAAAATGCCAGGCACTTACATGCTTATGGACGTGTGCGCTACAAGGGACGCTG 

AT GATAAATGCATCGAAGGCAGAA.TTGT GGT GACAGTCAGGGTGT CCCTATAT GACG 
6A 

AAGATAACAATGGTGTAATGGATGAAGGTAAGGTGTTCCATCTGAGACAATCGAGGA 

TGATAT CAAGGACTGTGGGCTCTTAGAC C AAGAT GTT GAACTCGATTAT AC GT GGAC 
7S 

TCAAAACGAGTGTGATCTACCAGACACAGTAGACGAGGCTGAAGACACACCGTCAGA 

AACT GGAGAATT CTT CT GGT AGAT CTAT CAGACTACTTT TATCAGCAGGACAACT GG 

TCGT TACCAGACACCTAT AACGT GT CCT C ATCAAT AAT GT GT AAAACAGAAATAATC 

GAT AGAAT AT T GAAAAT AAAAT GTTAATAAACACTGGTTGAAATATGAAAAAAAAAA 

5A 

AAAAAAAAC TCGAG 



Fig. 3 
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Untranslated region 

GAR 2TCGG C AC GAGTC GG AAAAGAAC AAA 

Translated region 

ATG GCT TGT ATC GTT TTC GTT GCT CTT GTC GCT CTA TGC TTA ATG 
MACIVFVALVALCLM 

CAA CCG GGT TCC GGT GAG GAA GTA CAA TGC GCG ATG AAT TGG ACA 
QP GSG EEVQCAMNWT 

CAA GCT AAT GAA TAT GTG TTC AAC GTG GAC TGG ATG ACC ATT TTC 
QANEYVFNVDWMTI F 

ATC TAC GAC TAT GGC GCT CAA GAG CAA CTG TAC GAA GAT CGG GCT 
IYDYGAQEQLYEDRA 

TTG GGG CTG TGT CGG ATT GAA CGG GCC GGC CCA GGT ACC ACA AAA 
LGLCRIERAGPGTTK 

GCC GTC TGG ATT AAC TGG AGT AAC GAC ACG CAG TCA TGT GTA ACA 
AVWINWSN DTQS CVT 

AGA AAA ACA ATC TTC TTC GAG GTT GGT GGA GAA ATT GCC CGG CTA 
RKT I FFEVG GEI A-RL 

GTT GAC TAC AGA CCA CAG GAA GAC GGA ACT GAG AAA ACT TTT ACA 
VDYRPQEDGTEKTFT 

AGA AAA TTC TCT-AGC AAA ATG CCA GGC ACT TAC ATG CTT ATG GAC 
RKFSSKMPGTYMLMD 

GTG TGC GCT ACA AGG GAC GCT GAT GAT AAA TGC ATC GAA GGC ACA 
VCATRDADDKCI EGT 

ATT GTG GTG ACA GTC AGG GTG TCC CTA TAT GAC GAA GAT AAC AAT 
IVVTVRVSLY DE DNN 

GGT GTA ATG GAT GAA GGT AAG GTG ATT CCA TCT GAG ACA ATC GAG 
GVMDEGKVI PSETIE 

GAT GAT ATC AAG GAC TGT GGG CTC TTA GAC CAA GAT GTT GAA CTC 
DDIKDCGLLDQDVEL 

GAT TAT ACG TGG ACT CAA AAC GAG TGT GAT CTA CCA GAC ACA GTA 
DYTWTQNEC DLP D T V 

GAC GAG GCT GAA GAC ACA CCG TCA GAA ACT GGA GAA TTC TTC TGG 
DEAEDTPSETGE FFW 

TAG ATC TAT CAG ACT ACT TTT ATC AGC AGG ACA ACT GGT CGT TAC 



CAG ACA CCT ATA ACG TGT CCT CAT CAA TAA 
* = stop for translation 



45 
90 
135 
180 
225 
270 
315 
360 
405 
450 
495 
540 
585 
630 
675 
720 
750 
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EcoR I 

GAA TTCGGCACGAGTCGGAAAAGAACAAA 



ATG 


GCT 


TGT 


ATC 


GTT 


TTC 


GTT 


GCT 


CTT 


GTC 


GCT 


CTA 


TGC 


TTA 


ATG 


CAA 


CCG 


GGT 


TCC 


GGT 


GAG 


GAA 


GTA 


CAA 


TGC 


GCG 


ATG 


AAT 


TGG 


ACA 


CAA 


GCT 


AAT 


GAA 


TAT 


GTG 


TTC 


AAC 


GTG 


GAC 


TGG 


ATG 


ACC 


ATT 


TTC 


ATC 


TAC 


GAC 


TAT 


GGC 


GCT 


CAA 


GAG 


CAA 


CTG 


TAC 


GAA 


GAT 


CGG 


GCT 


TTG 


GGG 


CTG 


TGT 


CGG 


ATT 


GAA 


CGG 


GCC 


GGC 


CCA 


GGT 


ACC 


ACA 


AAA 


GCC 


GTC 


TGG 


ATT 


AAC 


TGG 


AGT 


AAC 


GAC 


ACG 


CAG 


TCA 


TGT 


GTA 


ACA 


AGA 


AAA 


ACA 


ATC 


TTC 


TTC 


GAG 


GTT 


GGT 


GGA 


GAA 


ATT 


GCC 


CGG 


CTA 


GTT 


GAC 


TAC 


AGA 

nun 


CCA 


CAG 


GAA 


GAC 


GGA 


ACT 


GAG 


AAA 


ACT 


TTT 


ACA 


AGA 


AAA 


TTC 


TCT 


AGC 


AAA 


ATG 


CCA 


GGC 


ACT 


TAC 


ATG 


CTT 


ATG 


GAC 


GTG 


TGC 


GCT 


ACA 


AGG 


GAC 


GCT 


GAT 


GAT 


AAA 


TGC 


ATC 


GAA 


GGC 


ACA 


ATT 


GTG 


GTG 


ACA 


GTC 


AGG 


GTG 


TCC 


CTA 


TAT 


GAC 


GAA 


GAT 


AAC 


AAT 


GGT 


GTA 


ATG 


GAT 


GAA 


GGT 


AAG 


GTG 


ATT 


CCA 


TCT 


GAG 


ACA ATC 


GAG 


GAT 


GAT 


ATC 


AAG 


GAC 


TGT 


GGG 


CTC 


TTA 


GAC 


CAA 


GAT 


GTT 


GAA 


CTC 


GAT 


TAT 


ACG 


TGG 


ACT 


CAA 


AAC 


GAG 


TGT 


GAT 


CTA 


CCA 


GAC 


ACA 


GTA 


GAC 


GAG 


GCT 


GAA 


GAC 


ACA 


CCG 


TCA 


GAA 


ACT 


GGA 


GAA 


TTC 


TTC 


TGG 


TAG 







ATCTATCAGACTACTTTTATCAGCAGGACAACTGGTCGTTACCAGAC 
ACCTATAACGTGTCCTCATCAATAATGTGTAAAACAGAAATAATCGA 
TAGAATATTGAAAATAAAATGT TAATAAACACT GGTT GAAATATGAA 
AAAAAAAAAAAAAAAAC TCGAG 



Xho I 

Fig. 4B 
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EEVQCAMNWTQANEYVFNVDWMTIFIYDYGAQEQLYEDRALGLCRIERAGPGTTKAV 
WINWSNDTQSCVTRKTIFFEVGGEIARLVDYRPQEDGTEKTFTRKFSSKMPGTYMLM 
DVCATRDADDKCIEGTIWTVRVSLYDEDNNGVMDEGKVIPSETIEDDIKDCGLLDQ 
DVELDYTWTQNECDLPDTVDEAEDTPSETGEFFW 

Fig. 5 A 



MACIVFVALVALCLMQPGSGEEVQCAMNWTQANEYVFNVDWMTIFIYDYGAQEQLYE 
DRALGLCRIERAGPGTTKAVWINWSNDTQSCVTRKTIFFEVGGEIARLVDYRPQEDG 
TEKTETRKFSSKMPGTYMLMDVCATRDADDKCIEGTIWTVRVSLYDEDNNGVMDEG 
KVIPSETIEDDIKDCGLLDQDVELDYTWTQNECDLPDTVDEAEDTPSETGEFFW 



Fig. 5B 
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clone 40 

BioXAct 

zTth 



GAATrCGGCACGAGT CGGAAAAG AAC AAAATGGCTTGT ATCG TTTT CGTT 

TGGCTTGTATCGTTTTCGTT 



clone 4 0 

BioXAct 

zTth 



GCTCTTGTCGCTCTATGCTTAATGCAACCGGGTTCCGGTGAGGAAGTACA 
GCTCTTGTCGCTCTATGCTTAATGCAACCGGGTTCCGGTGAGGAAGTACA 
T ATGCTTAAT GCAACCGGGTT CCGGTGAGG AAGTACA 



clone 4 0 

BioXAct 

zTth 



ATGCG C GAT GAATTGGAC ACAAGCT AAT G AAT ATGT GTT CAAC GT GGACT 
ATGCGCGATGAATTGGACACAAGCTAATGAATATGTGTTCAACGTGGACT 
ATGCG CGATGAATTGGACACAAGCTAATGAATATGTGTTCAACGT GGACT 



clone 40 

BioXAct 

zTth 



GGAT G AC CATTTT C AT CTAC G ACTAT GGCGCT C AAGAGCAACT GT ACGAA 
GGATGACCATTTTCATCTACGACTATGGCGCTCAAGAGCAACTGTACGAA 
G GAT G ACCATTTT CAT CT AC G ACT AT GGCGCT CAAGAGCAACTGT ACGAA 



clone 40 

BioXAct 

zTth 



GATCGGGCTTTGGGGCTGTGTCGGATTGAACGGGCCGGCCCAGGTACCAC 
GATCGGGCTTTGGGGCTGTGTCGGATTGAACGGGCCGGCCCAGGTACCAC 
GATCGGGCTTTGGGGCTGTGTCGGATTGAACGGGCCGGCCCAGGTACCAC 



clone 40 

BioXAct 

zTth 



AAAAG CC GT CTGG ATT AACTGGAGT AACGAC AC GCAGTC AT GT GT AACAA 
AAAAGCCGT CTGG ATT AACTG GAGT AACG ACAC GCAGTCAT GTGT AACAA 
AAAAG CCGT CTGG ATT AACTG GAGT AACGAC ACG CAGTC AT GT GT AACAA 



clone 40 

BioXAct 

zTth 



G AAAAACAAT CTT CTT CGAGGTT GGTGGAGAAATTG C CCGGCT AGTTGAC 
GAAAAACAATCTTCTTCGAGGTTGGTGGAGAAATTGCCCGGCTAGTTGAC 
GAAAAACAATCTTCTTCGAGGTTGGTGGAGAAATTGCCCGGCTAGTTGAC 



clone 4 0 

BioXAct 

zTth 



TACAGACCACAGGAAGACGGAACTGAGAAAACTTTTACAAGAAAATTCTC 
T AC AG ACCAC AGG AAG AC GG AACTG AGAAAACT TTTAC AAGAAAATTCT C 
TACAGACCACAGGAAGACGGAACTGAGAAAACTTTTACAAGAAAATTCTC 



clone 40 

BioXAct 

zTth 



TAGCAAAATGCCAGGCACTTACATGCTTATGGACGTGTGCGCTACAAGGG 
TAGCAAAATGCCAGGCACTTACATGCTTATGGACGTGTGCGCTACAAGGG 
TAGCAAAATGCCAGGCACTTACATGCTTATGGACGTGTGCGCTACAAGGG 



clone 40 

BioXAct 

rTth 



ACGCTG AT G ATAAATGC AT CGAAGGC AC AATTGTGGTG ACAGTCAGGGTG 
ACGCTG ATGAT AAATGC ATCG AAGG CACAATT GTGGT GACAGT CAGGGTG 
ACGCTGATGATAAATGCATCGAAGGCACAATTGTGGTGACAGTCAGGGTG 



clone 40 

BioXAct 

zTth 



T CCCTAT AT GACGAAGATAAC AATGGTGT AATGGATGAAGGTAAGGTGAT 
T CCCT AT AT GACG AAGAT AAC AATGGTGT AATG GAT G AAGGT AAG GTG AT 
TCCCTATATGACGAAGATAACAATGGTGT AATGGATGAAGGTAAGGTGAT 



clone 40 
BioXAct 



TCCATCT GAG AC AAT C GAG GAT GAT ATC AAGGACTGT GGGCTCTT AGACC 
TCCATCT GAG AC AAT C G AGGAT GAT ATC AAGGACTGT GGGCTCTT AGACC 



Fig. 6 (Parti of 2) 
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rTth 



clone 40 

BioXAct 

rTth 



clone 40 

BioXAct 

rTth 



clone 40 

BioXAct 

rTth 



clone 40 

BioXAct 

rTth 



clone 40 ' 

BioXAct 

rTth 



clone 40 

BioXAct 

rTth 



TCCATCTGAGACAATCGAGGATGATATCAAGGACTGTGGGCTCTTAGACC 



AAGAT GTTGAACT CGATT AT ACGT GGACTCAAAACGAGT GT GATCT ACCA 
AAGAT GTTGAACT CGAT T AT ACGT GGACT CAAAACGAGTGT GATCTACCA 
AAGAT GTTGAACT C GATT AT AC GT GGACT C AAAACGAGT GT GATCT ACCA 



G ACAC AGT AG ACG AGGCT G AAG ACAC AC C GT C AGAAACT GG AG AATTCTT 
G AC AC AGTAG AC G AGG CTGAAGAC AC AC C GT C AGAAACT GGAGAATTCTT 
GACACAGTAGACGAGGCTGAAGACACACCGTCAGAAACTGGAGAATTCTT 



CTGGTAGATCTATCAGACTACTTTTATCAGCAGGACAACTGGTCGTTACC 
CTGGTAGAT CT AT C AGACT AC TTTT AT C AGCAGGAC AACT GGTCGTT AC C 
CTGGTANATCTATCAGACTACTTTTATCAGCAGGACAACTGGTCGTTACC 



AGACACCTATAAC GTGT C CT CATCAATAATGTGTAAAACAGAAATAATC G 

AGAC ACCT AT AACGTGTCCT C ATCAAT AATGTGT AAAACAGAAAT AATC G 

AGACAC CT AT AAC GTGT CCTC AT C AAT AATGTGT AAAAC 
**■*■**■ A-***************-**.*************** * 

ATAGAAT ATTGAAAAT AAAAT GTT AATAAAC ACT GGTTGAAAT AT GAAAA 
AT AGAAT ATT G AAAATAAAAT GTT AAT AAACACT GGTT GAAAT AT GAA 



AAAAAAAAAAAAAA C TCGAG 
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Oligo 1 



ACI ATH TTY TTY CAR GT 



Oligo 2A 



CAR GAR GAR GGN ACI GA 



Fig. 7 A 



TCI GTN CCY TCY TCY TG 



Oligo N 



TTY AAY GTI GAY TGG ATG 



M=A/C 



R=A/G 
K=G/T 



W=A/T 



S=G/C 



Y=C/T 



V-A/C/G H=A/C/T D—A/G/T B=C/G/T N=A/C/G/T 
I=inosine 



Oligo 3A 

ACA CAG CCC CAA AGC CCG AT 
Oligo 4S 

TTG CCC GGC TAG TTG ACT AC 
Oligo 5A 

CAT ATT TCA ACC AGT GTT TAT TAA 
Oligo 6A 

CAA TTG TGC CTT CGA TGC A 



Oligo 7S 




7B 



GGA CTG TGG 



GCT CTT AG 



Oligo 8S 



ATG GCT TGT 



ATC GTT TTC GT 



Oligo T7 
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Oligo ExS 

CCA CAC GGA TCC TGA GGA AGT ACA ATG 
Oligo ExA 

CCA CAC GGA TCC TTA TTG ATG AGG ACA 
Oligo Bacl 

CTT GTT TTT ATG GTC GTC TAC ATT TCT TAC ATC TAT GCG GAG 
GAA GTA CAA TG 

Oligo C9 12 

CCA CAC AGA TCT AGA ATG AAA TTC TTA GTC AAC GTT GCC CTT 
GTT TTT ATG GTC 

Oligo BV5 

TTT ACT GTT' TTC GTA ACA GTT TTG 
Oligo BV3 

CAA CAA CGC ACA GAA TCT AG 
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AccI 




630 






AflUX 




405 




734 


Alul 95 










AlwNI 


659 








Asp 718 




215 






Asul 




204 


209 




BanI 




215 






Banll 




564 






Bcnl 




51 


310 




BglXI 




678 






Bspl 286 




564 






BstNI 




213 




384 


BstUI 


77 








CfrlOl 


206 








Cfrl3l 


204 




209 




Ddel 




345 




528 


Dpnl 




174 




615 


EcoRI 


665 








EcoRII 


211 




382 




EcoRV 


547 








Fold 




136 




518 


Haell 




153 






Haelll 


206 




210 




HgaX 




431 






Hhal 




77 




152 


Hindi 




319 






Hinfl 




520 




598 


HinPlI 


75 




150 




HpaXI 


50 




57 




HphI 


71 




469 




Kpnl 




219 






Mael 




314 




372 


MaeXI 




114 




405 


Mael II 


245 




265 




MboII 


182 




274 




497 


653 




661 




Mnll 


54 




282 




750 










Msel 


41 




237 




Nael 


208 








Neil 




51 




310 


NlalXI 


264 


397 






NlalV 


55 




217 




Nail 




440 






NspHI 


397 








Plel 


592 








Rsal 


69 




167 




Sau3AI 


172 


613 


678 




Sau96I 


204 


209 






ScrFI 


51 


213 


310 


384 


SfaNI 


428 








TaqI 288 


441 


537 


585 




XhoII 


678 









565 
680 



554 



413 



411 

207 310 
529 



593 734 

457 716 

277 347 

531 627 



217 



Fig. 8 
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Nlaiv 
Mnll 
ScrFI 
Neil 
Bcnl 

Msel Hpall Hpall 

I Mill 

ATGGCTTGTATCGTTTTCGTTGCTCTTGTCGCTCTATGCTTAATGCAACCGGGTTCCGGT 60 

Hhal 
HphI BstUI 

Rsal HinPlI Alul Maell 

i I II I I 

G AGG AAGT AC AAT GC GC GAT G AAT T GGACACAAGCTAATGAATATGTGTTC AACGTGGAC 120 

Haell 

Hhal Dpnl 
Fokl HinPlI Rsal Sau3AI 

I 111 III 

TGGAT GACC ATT TTC ATC T ACG AC T AT GGC GC T C AAGAGC AACTGTACGAAGAT C GGGCT 180 

Haelll 
Sau96I 
Cfrl3I 
Nael Rsal 
Hpall NlalV 
Haelll BanI 
CfrlOI ScrFI 
Sau96I BstNI 
Cfrl3I EcoRII Kpnl 
MboII Asul Asul Asp718 Msel 

I I I I I I I I I I I I I 

TTGGGGCTGTGTCGGATTGAACGGGCCGGCCCAGGTACCACAAAAGCCGTCTGGATTAAC 240 

Maelll MboII 
Maelll Nlalll MboII Mnll TaqI 

I II I I I I 

TGGAGTAACGACACGCAGTCATGTGTAACAAGAAAAACAATCTTCTTCGAGGTTGGTGGA 3 00 

Mael 
ScrFI 
Neil 

Hpall MboII 
Bcnl Hindi Ddel 

III li 
GAAATTGCCCGGCTAGTT GACT AC AGAC C AC AGGAAGACGG AAC T GAG AAAACT TTT ACA 3 60 



Fig. 9 (Pari 1 of 2) 
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ScrFI 

BstNI NspHI Maell HinPlI 

Mael EcoRII Nlalll Afllll Hhal 

i II I III 

AGAAAAT TC TCTAGC AAAATGC CAGGCACTT AC AT GC T TAT GGAC GT GT GC GC T AC AAGG 420 

Hgal TaqI 
SfaNI Nsil Maelll HphI 

1 I II I I 

GACGCTGATGATAAATGCATCGAAGGCACAATTGTGGTGACAGTCAGGGTGTCCCTATAT 480 

Mnll 
Hinfl HphI 
MboII Fokl Ddel TaqI 

! II Hit 

GACGAAGAT AACAATGGT GT AAT GG AT GAAGGTAAGGT GATTCCATCTGAGACAATCGAG 540 

Ddel 

Bspl286 Maell 
EcoRV Fokl Banll TaqI Plel Hinfl 

It II tilt 

GATGAT AT C AAGGAC T GT GGGCTCTT AGACC AAGAT GT T GAACT C GAT TAT AC GT GGAC T 600 

Dpnl AccI 
Sau3AI Mnll MboII AlwNI 

II II II 

CAAAACGAGTGTGATCTACCAGACACAGTAGACGAGGCTGAAGACACACCGTCAGAAACT 660 

Dpnl 
XhoII 
EcoRI Sau3AI 
MboII Bglll Maelll 

tilt I 
GG AGAATT C TTC T GGT AGATCT ATC AGACT AC TT T TAT CAGCAGGACAACTGGTCGTTAC 720 

Maell 

Afllll Mnll 

I f 
CAGACACCTATAACGTGTCCTCATCAATAA 750 



Fig. 9 (Part 2 of 2) 
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FIGURE 10 
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1 . An isolated, purified or recombinant nucleic acid sequence comprising: 

(a) a sequence that encodes the apophotoprotein of pholasin (alternatively, 
5 'apopholasin*); 

(b) a sequence substantially homologous to or that hybridises to sequence (a) under 
stringent conditions; or 

(c) a sequence substantially homologous to or that hybridises under stringent 
conditions to the sequence (a) or (b) but for the degeneracy of the genetic code; or 

1 0 (d) an oligonucleotide specific for any of the sequences (a), (b) or (c). 

2. A sequence according to claim 1,- wherein the sequence that encodes for 
apopholasin is as shown in Figure 4B. 

3. A sequence according to claim U wherein the sequence that encodes for 
apopholasin is as shown in any one of Figuresl, 2, 3, 4A, 6 or 9. 

15 4. A sequence according to any preceding claim, wherein the apopholasin is non- 
glycosylated. 

5. A sequence according to any* preceding claim, wherein the apopholasin is 
glycosylated. 

6. An isolated, purified or recombinant construct incorporating a sequence encoding 
20 apopholasin protein according to any preceding claim. 

7. An isolated^ purified or recombinant construct incorporating a sequence encoding 
an apophotoprotein whose expression in a substrate, in association with a luciferin 
therefor, signals the presence of oxygen or an oxygen metabolite in the substrate. 

8. An isolated, purified or recombinant construct incorporating a sequence encoding 
25 an apophotoprotein whose expression in a substrate, in association witka Jueiferin 

therefor, signals the presence of oxygen or an oxygen metabolite in the absence of 
a corresponding luciferase in the substrate. 

9. A recombinant construct according to any one of claims 1 to 8, wherein the 
nucleic acid sequence is linked operably with nucleotides enabling expression and 

30 secretion of the apopholasin in a cellular host. 

10. DNA or RNA according to any of claims 1 to 9. 

11. An isolated, purified or recombinant polypeptide comprising apophotoprotein of 
pholasin (apopholasin) or a mutant or variant thereof having substantially the 
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same activity as apopholasin. 

12. An isolated, purified or recombinant polypeptide according to claim 11 
comprising the amino acid sequence of Figure 4 or Figure 5. 

13. The apopholasin according to claim 11 or claim 12 when expressed by 
5 recombinant DNA or RNA according to claim 10. 

14. The apopholasin according to claim 13, which is non-glycosylated. 

15. A cell, plasmid, virus or live organism that has been genetically engineered to 
produce an apoprotein, said cell, plasmid, virus or live organism having 
incorporated expressibly therein a sequence according to any one of claims 1 to 

10 10. 

16. A vector comprising a sequence according to any one of claims I to 10. 

17. A host cell transformed or transfected with a vector according to claim 16. 

18. A BOIP, as defined herein, comprising an apophotoprotein according to any one 
of claims 1 1 to 14 in association with a luciferin. 

15 19. A BOIP according to claim 18, wherein the luciferin is derived from Pholas 
dactylus. 

20. A method for the preparation of a BOIP, as defined herein, which method 
comprises bringing an apophotoprotein, such as recombinant apopholasin, into 
association with a luciferin therefor, such as a luciferin derived from Pholas 

20 dactylus. 

21. A BOIP, apophotoprotein thereof, or a nucleic acid sequence encoding either of 
these, which comprises a sequence according to any one of Figures 2 to 6 or 9 that 
has been chemically or genetically modified. 

22. A method for the detection and/or measurement of oxygen or one of its 
25 metabolites extracellularly, which method comprises providing a BOIP, such as 

native or chemically- or genetically- modified BOIP or a 'rainbow protein' based 
on such a BOIP, extracellularly and thereafter detecting and/or quantifying light 
emission therefrom and/or changes in colour, intensity and/or polarisation of 
emissions), wherein the apophotoprotein comprises recombinant apopholasin. 

30 

23. A method for the detection and/or measurement of oxygen or one of its 
metabolites in live cells (intracellularly), which method comprises providing a 
BOIP, such as native or chemically- or genetically- modified BOIP or a 'rainbow 
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protein* based on such a BOIP, intracellularly and thereafter detecting and/or 
quantifying light emission therefrom and/or changes in colour, intensity and/or 
polarisation of emission(s) therefrom. 

24. A method according to claim 22 or 23 , wherein said BOIP includes a signal 
peptide, targetting it to a pre-determined extra- or intra- cellular site. 

25. A method according to claim 22 or claim 23, comprising incubating a test sample 
with a cell according to claim 15 or with a membrane preparation derived 
therefrom. 

26. A method according to any one of claims 22 to 24, wherein light emission takes 
place in the absence of a luciferase. 

27. The use of a sequence or a protein according to any one of claims 1 to 21 in the 
detection, diagnosis or measurement of oxygen or a metabolite thereof. 

28. A diagnostic kit incorporating a sequence or protein according to any one of 
claims 1 to 21. 

29. A process for obtaining a substantially homologous source of apopholasin, which 
comprises culturing cells having incorporated expressibly therein a polynucleotide 
encoding apopholasin as defined in any one of claims 1 to 10, and thereafter 
recovering the cultured cells. 

30. A method, use or kit according to any one of claims 20 to 27, substantially as 
hereinbefore described with particular reference to the Examples. 



1 

IN AND DNA CODING THEREF 



The present invention relates to a protein, capable of bioluminescence, cDNA coding 
therefor and their uses, inter alia, in diagnostics and therapy. In particular, this invention 
relates to the cloning and sequencing of cDNA coding for pholasin from the bivalve 
mollusc Pholas dactylus. 

The term 'bioluminescence' refers to the emission of light resulting from a chemical 
reaction within, or produced by, a living organism. The essential components to the 
chemical reaction are: an organic molecule, usually comprising a luciferin; oxygen or one 
of its metabolites; and an enzyme or luciferase that catalyses the oxidation of the luciferin. 
The chemi luminescent reaction responsible for bioluminescence may be represented as 
follows: 



Luciferin + 0 2 , or 0" 2 , or H 2 0 2 , or OH or OCT or OX" or 3 0 2 (+ luciferase) 
oxyluciferin + light. 



Up to three other substances may also be required to generate light or to generate light of 
the required colour and intensity. These are as follows: 

(a) A cation, such as H + , Ca 2+ , Mg 2+ or a transition metal cation (eg Cu + /Cu 2+ , 
Fe 2+ /Fe 3+ , La 3+ and V 3+ ); 

(b) A co-factor such as NAD(P)H, FMN or ATP; and/or 

(c) A fluor as an energy transfer acceptor. 

Five chemical families of luciferin are known: 

(a) Aldehydes (found in the freshwater limpet Latia, earthworms, and with FMN in 
bacteria); 

(b) Imidazolopyrazines, which are the compounds most commonly responsible for 
bioluminescence in the sea (found in Sarcomastigophora, Cnidaria, Ctenophora, 
Annelida, Chaetognatha, some Arthropoda, some Mollusca and some Chordata); 

(c) Benzothiazoles (found in beetles such as fireflies and glow-worms); 

(d) Linear tetrapyrroles (found in dinoflagellates, euphausiid shrimp and some fish); 



and 

(e) Flavins (found in bacteria, fungi, polychaete worms and some molluscs). 

Chemiluminescent reactions involving these luciferins may produce a glow or a flash with 
an emission of violet, blue, blue-green, green, yellow, orange or red light, or occasionally 
UV or IR light. The light emission may be linearly or circularly polarised. The luciferin 
or its product may also be detected and quantified by fluorescence or phosphorescence. 
As a chemical reaction is directly responsible for the light emission, it does not require 
exposure to UV, visible or IR light. However, some bioluminescent systems, such as that 
in the red organ of the deep sea fish Malacosteus, exhibit a photo-chemi luminescence, 
where light can trigger or enhance the chemiluminescent reaction. [Reference is directed 
to Chemiluminescence: Principles and Applications in Biology and Medicine, A K 
Campbell (1988), Horwood/VCH Chichester, Weinheim.] 

In the case of some bioluminescent proteins, the luciferin is so tightly or covalently bound 
to the protein molecule that it does not diffuse away into the surrounding fluid as a result 
of the chemiluminescent reaction. In this case, the protein-luciferin complex is known as 
a photoprotein; and the protein itself is referred to as an apophotoprotein. Some 
bioluminescent proteins are proteins whose light emission or radiation depends on or may 
be altered by oxygen or one of its metabolites; these bioluminescent proteins are 
hereinafter referred to as 'bioluminescent oxidative indicator proteins' (BOIPs).BOIPS 
may thus be photoproteins or luciferin-luciferase systems. 

BOIPs, therefore, may be used to detect and quantify oxygen or one of its metabolites in 
individual cells, defined compartments of living cells such as the nucleus, whole organs 
and organisms - both animals and plants, including microbes such as viruses and bacteria 
and protozoa - as well as substances of biological interest such as substrates, metabolites, 
vitamins, drugs, intra- and extra-cellular signals, enzymes, antigens, antibodies and 
nucleic acids. Heretofore, it has only been known to employ native BOIPs extracellularly. 

The present invention therefore relates to a method for the detection and/or measurement 
of oxygen or one of its metabolites in live cells (intracellular), which method comprises 
providing a BOIP, such as native or chemically- (or genetically-) modified BOIP or a 
'rainbow protein' based on such a BOIP, intracellularly and thereafter detecting and/or 
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quantifying light emlWrcn therefrom and/or changes in coWw, intensity and/or 
polarisation of emission(s) therefrom. 

Furthermore, it has now been found that, by sequencing the BOIP and identifying the 
5 cDNA encoding therefor, the recombinant BOIP can also be used in such a method, or 
chemically- or genetically-modified recombinant BOIP, or a 'rainbow protein' based on 
such a BOIP. For example, the bivalve mollusc Pholas dactylus has been shown to 
comprise a native photoprotein, which interacts with a luciferase, when they are secreted 
together by the mollusc to produce light when O2 or one of its metabolites is present. 

10 References to the Purification and Properties of Pholas Dactylus Luciferin and Luciferase 
can be found by Michelson in Methods in Enzymology LVII 385-406 (1978). References 
to detection of activation of neutrophils by detection of superoxide anion can be found by 
Roberts in Anal Biochem 160 139-148 (1987) and by Muller et al in J Biolum Chemilum 
3 105-113 (1989). The native photoprotein (known as pholasin) is made up of a 

15 glycosylated apoprotein (34kDa) with a small organic molecule, the luciferin, tightly 
bound to it. This luciferin (whose structure is unknown - Muller and Campbell in J 
Biolum Chemilum 5 25-30 (1990)) can be extracted from the protein moiety - the 
apopholasin - or from the organism by a standard treatment, such as mild acid. The 
pholasin may be collected from live molluscs found in sedimentary rocks at low water 

20 along the south coast of England from Plymouth to Folkestone and also along the French 
channel coast and in the Mediterranean. Further details may be obtained from marine 
fauna and the references cited herein. 

We have surprisingly found that pholasin can generate light even without the presence of 
25 the corresponding luciferase by addition of oxygen metabolites such as 0 2 H 2 0 2 , OCI- 
or other oxyhalide anions, or organic peroxides, and certain organic solvents such as 
dimethyl sulphide (DMSO) or dimethyl formamide (DMF). 

We have now identified the cDNA encoding for the (non-glycosylated) apoprotein of 
30 pholasin, which may also be called 'apopholasin'. Accordingly, the present invention 
therefore further provides an isolated, purified or recombinant nucleic acid sequence 
comprising: 



(a) The apophotoprotein of pholasin (alternatively, 'apopholasin'); 
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(b) A sequence suMWtially homologous to or that hybrid iseWFsequence (a) under 
stringent conditions; or 

(c) A sequence substantially homologous to or that hybridises under stringent 
conditions to the sequence (a) or (b) but for the degeneracy of the genetic code; or 

(d) An oligonucleotide specific for any of the sequences (a), (b) or (c). 



The present invention will now be further described with reference to the accompanying 
Figures, in which: 



Figure 1 (SEQ ID NOS 1-3) shows three different cDNAs encoding apopholasin, referred 
to as clones 40, 3 and 5. Nucleotides in bold type show codons used for initiation and 
termination of translation; 



Figure 2 (SEQ ID NOS 1-3) shows the three sequences of Figure 1 (SEQ ID NOS 1-3) 
aligned to demonstrate the sequence similarity. This figure was generated by Clustal. 
Positions which are indicated with a star are identical in all three clones. The codons for 
the initiation and termination of translation are highlighted in bold; 

Figure 3 (SEQ ID NO: 1) shows the oligonucleotides used for the complete sequencing of 
the positive clones. These were identified from the cDNA library; their positions in clone 
40 are shown. Oligonucleotides are shown in bold type, portions of the flanking sequence 
of the Bluescript plasmid are shown in italic type; 

Figure 4 (SEQ ID NOS 1 and 4) describes the protein sequence described by the DNA 
sequence coding for apopholasin and shows, in Figure 4A, the complete sequence (SEQ 
ID Nos 1 and 4) of the positive clone 40 identified from the Pholas dactylus light organ 
library. The first 20 amino acids at the N-terminus are a signal peptide, and this can either 
be retained or removed when generating the BIOP as described in this invention and, in 
Figure 4B (SEP ID NO: 1) . the cDNA coding for apopholasin with untranslated 5' and 3' 
ends. The untranslated regions are also shown; 

Figure 5 (SEQ ID NOS 5-6) describes the protein sequence for pholasin with (5B) (SEQ 
ID NO: 6) and without (5A) (SEP ID NO: 5) the signal peptide; 
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Figure 6 (SEP ID Nd^ piece of SEP ID NO: 1 and SEP 7^*10: 23) shows the 
sequence for apopholasin genomic DNA, Two gDNA clones were indentifed but no 
introns were found; the Figure shows an alignment of the cDNA from cDNA clone 40 
(SEP ID NO: U and the gDNA amplified by both vTth DNA polymerase XL (SEP ID 
NO: 23) and BioXAct polymerase (piece of SEP ID NO: 1 ) . The sequences of the PCR 
product and the inserts in pGEM T were aligned with the sequence of the cDNA of clone 
40 and were identical to this cDNA; 

Figure 7 (SEP ID NOS 7-22) describes the oligonucleotides used for screening and 
expression. Degenerate oligonucleotides for library screening are shown in Figure 7A; 
(SEP ID NOS 7-10) non-degenerate ones in Figure 7B (SEP ID NOS 7-16) : and 
oligonucleotides used for protein expression are shown in figure 7C (SEP ID NOS 17-22 

If 

Figure 8 lists the main restriction sites in the DNA for engineering pholasin; 

Figure 9 (piece of SEP ID NO: 1) is a schematic representation of Figure 8 mapped to the 

sequence of Figure 4A (translated region), and 

Figure 10 is a time course of hypo-chlorite triggered chemiluminescence. 

Accordingly, the present invention provides recombinant DNA encoding the 
apophotoprotein apopholasin and comprising the nucleotide sequence of the sequence 
disclosed in Figure 4B (SEP ID NO: 1) . Three different cDNAs coding for apopholasin 
have been isolated, having differing non-coding regions, respectively disclosed in Figure 
1 (SEP ID NOS 1-3) . The genomic DNA (gDNA), which contains no introns, has been 
shown (Figure 6) (SEP ID NO: 1. piece of SEP ID NO: L SEP ID NO: 23) to comprise 
the same basic sequence as the cDNA. 

Pholasin is a glycoprotein having 11.1 glusamine, 9.8 fructose, 7.1 mannose and 5.2 
galactose residues. The cDNA for apopholasin has a molecular weight of 23,456 
compared to 34,600 of the pholasin extracted from Pholas. The difference in the 
molecular weights of native versus recombinant apopholasin is due to the glycosylation of 
the native protein and luciferin. The isoelectric point of the translated protein calculated 
by the ISOELECTRIC command of the GCG programme is at 3.84. The native protein 
has a lower isoelectric point (<3.5), probably due to the presence of bound sulphate. 
The three clones (Figure 2) (SEP ID NOS 1-3) isolated from the library encode a unique 
protein (Figures 4 and 5) (SEQ ID NOS 1 and 4-6) . which does not have the same amino 
acid sequence as any known protein in the SwissProt database. The present invention 



therefore not only pro^Ks cDNA and RNA coding for the^Kein, but also the 
recombinant protein per se, with or without glycosylation units. A comparison of 
segments of the pholasin protein sequence with the proteins in the SwissProt database 
identified several proteins with regions having a high sequence similarity to regions of the 
5 cloned protein. These included several proteins which interact with nucleotides (Table 1). 



Table 1 A comparison of sections of the sequence of the cloned protein with sections 

of proteins which interact with nucleotides. 



Protein 


Homologous region 
cloned protein 

homology (+ denotes a conserved amino acid) 
selected protein 


tRNA-splicing endonuclease p 
subunit fDiece of S EO ID NO: 4) 
Saccharomyces cerevisiae 
EC 3.1.27.9 (SEO ID NO: 24) 


SLYDEDNNGVMDEGKV I Pbhi IE 
+L DEDNN + + G ++P E ++E 
NLRDEDNNLLDENGDLLPLESLE 

LDQDVELDYTW 
LD DV DYTW 
LDHDVSKDYTW 


ATP-AMP transphosphorylase 
(oiece of SEO ID NO: 5)Cvprinus 
carpio 

EC 2.7.4.3 (SEO ID NO: 25) 


VMDEGKVI PSETIEDDIKDCGLLDQDVELDY 
+M +G+++P + T + D IKD + DV Y 
IMQKGELVPLDTVLDMIKDAMIAKADVSKGY 


DNA primase ( piece of SEO ID 
NO: 6) 

Synechocystis sp. 

EC 2.7.7. -(SEO ID NO: 26) 


EEVQCAMNWTQANEYVFNVD 
++VQ M ++Q+ + +FN D 
DQVQSLMRFSQSKQI Ir Nr u 


purine permease (piece of SEO ID 
NO: 6) 

Emericella nidulans (SEO ID NO: 
27) 


VQCAMNWTQANEYV 
IMCSVDWTRRNRFI 


DNA repair protein complementing 
XP-A cells homotoaue ("piece of 
SEO ID NO: 5) 

Drosophila melanozaster (SEO ID 
NO: 28) 


PDTVDEAEDTPSET 
PDT DE EDT + T 
PDTYDEEEDTYTHT 


ATP svnthase p chain (piece of 
SEO ID NO: 5) 
Peptococcus niger 
EC 3.6.1 .34 (SEO ID NO: 29) 


DTVDEAEDTPSET 
D +DEA + PSET 
DPI DEAGEVPSET 


DNA polvmerase a (piece of SEO 
ID NO: 6) 
Homo sapiens 


DEDNNGVMDEGKVI PSETIEDDIKD 
D+D G +++G+ I + + EDD D 
DDDGIGYVEDGREI FDDDLEDDALD 
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EC 2.7.7.7 (SEOIDNO^O) 



Similarity was found between the Vargula luciferase and Renilla LBP, but no other 
bioluminescent protein. 



5 Sequence homology between the cloned protein (piece of SEP ID NO: 5) and (a) Vargula 
luciferase (SEQ ID NO: 3 0 (b) Renilla LBP (SEQ ID NO: 321 An area of high 
homology in all three proteins is in bold print. 



10 (a) 148 206 
GTIWTVRVSLYDEDNNGVMDEGPCVIPSETIEDDIKDCGLLD-QDVELDYTWTQNECDL 
V+VSL D + + + T+ DID + .V++ 4- + 

YWNTWDVKVSLRDVESYTEVEKVTIRKQSTWDLIVDGKQVKVGGVDVSIPYSSENTSI 
353 412 

15 (b) 

105 166 
STMPGTYMLMDVCATRDADDKCIEGTIWTVRVSLYDEDNNGVMDEGKVIPSETIEDDIKDC 

+ TR + VR+S+ + N+ K I 

AIKIAKLSAEKAEETRGFLRVADQLGLAPGVRISVEEAAVNATDSLLKMKAEEKAMAVIQSL 
20 41 104 



Three potential glycosylation sites on the protein have the consensus triplet sequence Asn- 
Xaa-Ser/Thr (where Xaa can be any residue except proline). Thr 216 was identified as a 
potential site of O-l inked glycosylation by a neural network which has been trained to 
25 identify this type of glycosylation. The amino acid sequence was also entered into a neural 
network which had been trained to identify eukaryotic signal peptides. This confirmed that 
the most likely cleavage site is between positions 20 and 2 1 (GSG-EE). 



Many families of proteins contain a "signature" sequence of amino acids. The sequence of 
30 the clones did not contain any of these signatures present in the PROSITE database. The 
amino acids from 170 to 185 correspond to the calcium binding consensus sequence 
[DENQST]X[DENQST]X[DENQST]X[DENQST]X[DENQST]XX[DENQST]. Thirteen 
potential phosphorylation sites were discovered that matched the consensus sequences for 
either the kinase phosphorylation site [RK](2)-x-[ST], the protein kinase C phosphorylation 
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site [ST]-x[RK] or the cWm kinase II phosphorylation site [ST]-x(2J^E]. 

Three N-linked glycosylation sites were identified in the translated sequence of the clones A 
neural network has been trained to identify this type of glycosylation which identified Thr 
5 216 as a potential site of O-linked glycosylation. At least one of these sites must be 
glycosylated in the native protein in order to account for the presence of the sugar residues. 
A putative signal peptide region preceded the N terminus of the secreted protein (determined 
by amino acid sequencing and was identified as a signal peptide by a neural network). To 
confirm this result the protein sequence was searched with PSORT for motifs which would 
10 locate the cloned protein in a cellular compartment. The protein sequence did not contain 
any transmembrane regions or N-myristoylation patterns which would indicate the presence 
of a lipid anchor. No targeting or retention sequences were found for the nucleus, 
mitochondria, endoplasmic reticulum or peroxisome. 

15 The fact that the clones had some sequence similarity with proteins that interact with 
nucleotides may suggest that pholasin binds a co-factor as part of the chemi luminescent 
reaction. Beetle luciferases require ATP binding for chemiluminescent activity. There is no 
P-loop binding motif ((A,G)x4GK(S,T) or (A)x{4}GK(T)) (SEP ID NO: 33) in the amino 
acid sequence of these clones. However, not all ATP binding proteins contain this motif. 

20 Neither does the cloned protein contain the GXGXXG phosphate binding consensus 
sequence necessary for the binding of other co-factors such as nicotinamide adenine 
dinucleotide. 

The amino acid and sugar components of pholasin are not able to emit light at the 
25 wavelength of the native protein (490nm). This indicates that there must be a chromophore 
bound to the protein. There are, however, proteins in which the chromophore is composed 
of modified amino acid residues within the polypeptide. The best characterised of these is 
the green fluorescent protein (GFP). This has a chromophore which is a ring formed by the 
autocatalytic cyclisation of the residues Ser-dehydroTyr-GIy. The serine may be mutated to 
30 a threonine, which increases the amplitude of the emission at 488nm. Pholasin had no 
similar amino acid sequence. Putative luciferin binding regions have been identified for two 
bioluminescent chemistries. Aequorin has a putative coelenterazine binding region, which 
is also present in two sections of the Vargula hilgendorfii luciferase. The sequence of the 
cloned protein has no homology to the putative luciferin binding site of aequorin, but the 
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;rase from residue 353 to 411 has som 
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ilarity, as does the 



LBP of Renilla reniformis, which also binds an imidazolopyrazine. This may indicate that 
the chemistry of pholasin bioluminescence involves an imidazolopyrazine luciferin. 
However, the region of homology is very small. The beetle Iuciferases contain an area of 
5 low sequence homology which may bind the benzothiazole luciferin. This low homology 
may account for the different colours of beetle bioluminescence. used a luciferin analogue 
(2-(4-benzoylphenyl) thiazole-4-carboxylic acid which photo inactivated the luciferase active 
site of the firefly Photinus pyralis. This photoinactivation was directly linked to the 
degradation of a small peptide sequence HHGF (SEP ID NO: 341 (residues 244-257). This 

10 is therefore postulated as a luciferin-binding site. The cloned protein does not have any 
sequence homology with these putative binding regions. Two strongly conserved regions of 
amino acids have also been found in both the luciferase and the luciferin binding protein of 
the dinoflagellate Gonyaulax polyedra. These regions were compared to the cloned protein, 
but no sequence similarity was found. No sequence identity could be established between 

1 5 the bacterial Iuciferases and the cloned protein. 

Therefore, the present invention provides cloned apophotoprotein apopholasin (and the 
cDNA coding therefor), which has identical properties to native (but non-glycosylated) 
apopholasin with respect to molecular weight, amino acid composition, potential for 
20 glycosylation, its highly acidic pi and its cellular location. Hence, the present invention 
can further provide the corresponding BOIP or modified BOIP, according to standard 
methods. 

The corresponding BOIP is preparable by bringing the apophotoprotein pholasin into 
25 association with the luciferin, also using standard methods. Although the luciferin is 
tightly bound in the native pholasin BOIP, it has been found that it may not be the case in 
the recombinant pholasin BOIP; indeed the luciferin may be weakly bound or merely 
present with the apoprotein. For example, a methanol, aqueous, acidic or other extract of 
Pholas dactylus (whole organism or light organ dissected from the animal) containing the 
30 'luciferin', or the pure luciferin, may be added to the solution, cell or organism__(Figtire_l_0 
shows the time course of hypo-chlorite triggered luminescence in these circumstances). A 
time course of apopholasin reactivation was performed by incubating partially purified 
recombinant pholasin secreted by insect cells with acidimenthanol extract of Pholas 
dactylus (•) for 0 (solid line), 1 (---), 2 ( ), 6( ) or 24 hours ( ), or incubated 



without extract (o). Controls of buffers only with no protein or acidTmethanoI extract (□) 
and extract alone (■) were treated in identical conditions. Chemiluminescence was 

triggered by the addtion of 2% soidum hypochlorite at 10 seconds (arrow) and is shown as 
chemiluminescent counts minus background light. A typical curve obtained by 
5 hypochlorite triggering of native pholasin is also shown (X). A representative experiment 
carried out in duplicate is shown. 

The luciferin associates with the apo-BOIP forming the photoprotein or remains loosely 
bound so that it turns over like a luciferase.. The luciferin on the photoprotein then reacts 
10 with oxygen or one of its metabolites to produce light, in the presence or absence of the 
luciferase. The light emission may be detected, quantified, or imaged using a 
luminometer, photographic film or imaging camera, or by the naked eye. Alternatively, 
light emission may be generated spontaneously by intra- or extra-cellular metabolites 
reacting with the apo-BOIP, 

15 

Although illustrated with respect to pholasin, the following may apply to any BOIP: the 
BOIP can be produced directly from native DNA, or from DNA engineered or amplified 
by the polymerase chain reaction. By this means, sites can be inserted within the protein 
by splitting the DNA into two or more pieces, or by adding DNA sequences to the 5' or 3' 
20 ends. For example, the DNA may be expressed in bacteria, yeast, an insect or human cell, 
or other suitable organism to produce protein which can be extracted and used. 

In this instance, the protein produced from the cloned DNA reacts with oxygen or a 
metabolite of oxygen, such as the superoxide anion (0 2 ")> hydrogen peroxide (H 2 0 2 ), 

25 hydroxy radical (OH ), an oxyhalide anion (OCf, OBr , Of, OSCN"), nitric oxide (NO), an 
organic hydroperoxide or a radical ROO . The change in light emission enables the 
oxygen or metabolite(s) to be detected and quantified in live cells, organelles, or on the 
outer or inner surface of the plasma membrane, or within an organ of a live organism 
without the need to break them open or the need to separate bound and free fractions. 

30 This also enables an enzyme producing oxygen or one of its metabolites, such as 
chlorophyll, or enzymes such as oxidases and oxygenases which react directly with 
oxygen or one of its metabolites to attach oxygen to the substrate to be detected and 



1 1 

quantified in live cells, organs and whole organisms, or extracts from any one of these. 

Also the BOIP can be made in vitro by transcription/translation in a cell lysate such as 
rabbit reticulocyte lysate or wheat germ extract containing RNA polymerase. The DNA 
5 for the BOIP is first engineered to contain an RNA polymerase promoter, such as T7, 
SP6; bacterial promoter(s), such as lac, ara or trp; or mammalian promoter(s), such as 
actin, myosin, myelin proteins, TK, MRT-V, SV40, CMV, RSV, metallothionine, 
antibody, G6P dehydrogenase, and can be amplified in vitro using the polymerase chain 
reaction. A poly-A tail may be added at the 3' end and a tissue specific promoter or 
10 enhancer sequence added to the 5' or 3' end of the DNA coding for the BOIP or modified 
BOIP, enabling it to be expressed specifically in a target cell, such as a myocardial cell or 
a cancer cell. The expression of the BOIP in the target cell is detected and quantified by 
light intensity, colour or polarisation, as previously mentioned. 

1 5 The BOIP, or its DNA or RNA, may be incorporated into a live bacteria or eukaryotic cell 
using phage, virus, plasm id, calcium phosphate transfection, electroporation, liposome 
fusion, membrane pore forming proteins, micro-injection or DNA gun. Once inside cells 
or an appropriate extracellular environment, cell activation or injury will initiate or 
change the light emission from the BOIP. For example, expression in live organisms by 

20 micro- injection of protein, RNA or DNA, or by transgenic manipulation to produce a cell, 
such as a bacterial, microbial, animal or plant cell, eg a white blood cell, a heart cell, or a 
yeast, protozoan, fruit fly (Drosophila), nematode worm, polychaete worm, fish, human, 
mouse, rat, sheep, pig, horse or plant, which can generate its own light. 

25 

The BOIP can also be incorporated into a defined part of a live cell by chemical means or 
by genetically engineering the BOIP to contain a signal peptide, which locates the BOIP 
to the inner or outer surface of the plasma membrane or within a particular organelle, such 
as peroxisome, mitochondrion, chloroplast, tonoplast, endoplasmic, reticulum, Golgi 
30 apparatus, endosome, lysosome, secretory vesicle, nucleus, nucleolus, nuclear membrane, 
plasma membrane, proteosome, or gap junction, or structure such as membrane receptor 
ion channel microtubule, cytoskeleton, nuclear skeleton, nuclear receptor, mitotic spindle 
or microfilaments. The signal peptide, added either chemically or genetically, will 
normally target the normal or modified BOIP to a particular intra- or extra-cellular site. 
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For example, the sequence MLSRLSLRLLSRYLL (SEQ ID NO: 35) or part of 
cytochrome oxidase on the N-terminus will target the BOIP to the mitochondrion; 
KKSALLALMYVCPGKADKE (SEQ ID NO: 36) or MLLPVPLLLGLLGLAA (SEQ ID 
NO: 37) at the N-terminus will target the BOIP to the endoplasmic reticulum, a KDEL 
5 (SEQ ID NO: 38) or HDEL fSEO ID NO: 39) or KEEL sequence (SEP ID NO: 40) at the 
C-terminus retaining it there. SKL at C-terminus targets BOIP to the peroxisome; 
PKKKRKV (SEQ ID NO: 41) or an extension of this SV40 large T-antigen signal will 
target it to the nucleus; and a palmitoylation and/or a myristoylation signal will target it 
to the plasma membrane. By coupling the BOIP to another protein that targets itself to a 
10 particular site, the BOIP can also be targeted there. For example, coupling the nuclear 
proteins nucleoplasmin or lamin B receptor to BOIP targets it to the nucleus; cytochrome 
oxidase at the N-terminus targets BOIP to the mitochondria; chlorophyll at the N- 
terminus targets BOIP to the chloroplast; a connexin at the N-terminus targets BOIP to 
the gap junction or plasma membrane; and SNAP 25 to the plasma membrane. 

15 

Other modifications to the apoprotein, BOIP, or nucleotides coding therefor include, but . 
are not limited to: 

The apoprotein, such as apopholasin, may also be glycosylated, and used to detect and 
20 quantify secretion or movement of proteins through the secretory pathway. 

Nucleic acid coding for the BOIP when expressed inside a live cell may not only be 
modified but also regulated in this cell by gene expression, such as by promoters, 
enhancers or oncogenes. For example, the apoprotein, such as apopholasin, may be 
25 coupled to a gene regulator protein, such as a transcription factor, by genetic or chemical 
manipulation, such that the movement through a cell or of the regulator protein or its 
activity, can be detected or quantified. 

The BOIP or apoprotein, or its DNA may be linked to another protein or DNA used in 
30 therapy, such that the other protein or DNA can be detected in live cells or in a whole 
organism, eg a human. 



The apoprotein, such as apopholasin, can also engineered genetically or chemically to 
contain a site or sites which can be covalently modified by enzymes such as 



phosphorylation (incl 




ser/thr, his and tyr kinases 
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phosphatases), 



tranglutamination, proteolysis, ADP ribosyiation, gly-or glu-cosylation, halogenation, 
oxidation, methylation, palmitoylation, myristylation and farnesylation. 

5 The apoprotein, such as apopholasin, can be engineered genetically or chemically to 
contain an antigen or intracellular signal binding site, such as Ca 2+ , cyclic AMP, cyclic 
GMP, cyclic CMP, IP 3 , 1P 4 , diacyl glycerol, ATP, ADP, AMP, GTP, or any oxy- or 
deoxy-ribonucleoside or nucleotide, a substrate, a drug, a nucleic and/or a gene regulator 
protein. 



The BOIP may also be converted to a rainbow protein by engineering a particular site 
such as described hereinabove into the BOIP, at the N- or C -terminus, or between a 
chimera of the BOIP and an energy transfer acceptor, such as GFP (wild type or any of the 
mutant GFPs). This is known as chemiluminescence, bioluminescence or fluorescence 

15 resonance transfer (CRET, BRET or FRET, respectively). Conversion of the BOIP to a 
'rainbow protein' may be effected by reaction with a cellular substance, modification 
genetically or chemically, or by linking the BOIP to a fluor, such as the green fluorescent 
protein or the red fluorescent protein in the deep sea fish Malacosteus. The result is a 
BOIP which changes its colour and/or intensity and/or polarisation of emission. The 

20 change in colour occurs by energy transfer, eg resonance transfer (CRET or FRET) or 
electron transfer. 

The initial (unmodified) BOIP may be the apophotoprotein, its DNA or RNA, from the 
bivalve mollusc Pholas dactylus, or it may be another BOIP, such as one from the 
25 mollusc Rocellaria grandis or the squid Ommastraphes, or earthworm luciferase, which 
produce light with oxygen metabolites in a way very similar to Pholas dactylus. 

The BOIP, apo-BOIP, or nucleic acid coding for it, whether modified or not, may 
therefore be used in a range of biology and investigations such as: 



(a) Detection, location and measurement of signals in substrates, such as live cells, 
organs or organisms, or in extracellular fluids; 

(b) Detection, location and measurement of oxygen and its metabolites in substrates, 
such as live cells, organs or organisms, or in extracellular fluids, water (sea and 
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fresh), soil or the atmosphere; 

(c) Detection and location of normal cells such as microbes (protozoa, yeast, fungi, 
moulds, bacteria, viruses); 

(d) Detection and location of abnormal cells, such as cancer cells, hyperactive cells in 
5 rheumatoid arthritis and other inflammatory diseases, cells infected with a 

pathogen, such as a virus or other infectious agents, cells damaged by physical, 
chemical or biological attack, cells damaged by perfusion or reperfusion injury or 
cells damaged by oxygen or one of its metabolites; 

(e) Measurement and location of enzymes, particularly those producing oxygen or its 
1 0 metabolites, and other tumour reactions in cells or biological fluids; 

(f) DNA and RNA binding assays; 

(g) Immunoassay and other protein binding assays; 

(h) In genetic engineering, in the development of transgenic animals and plants, and 
microbes; in horticulture; agriculture; medicine and veterinary medicine; and/or 

1 5 (i) in genetic entertainment by incorporation into light sticks, greeting cards or toys to 
produce light of various colour, intensities, oscillations, flashes and glows; or in 
comestibles, such as food, drinks, including beers, wines, spirits, colas and other 
soft drinks. 



20 

Accordingly, the present invention further provides an apoprotein, such as pholasin 
apoprotein (or apopholasin) in both unglycosylated and glycosylated forms, and a BOIP 
thereof, such as pholasin, either alone (but excluding native proteins per se that have 
already been isolated, such as native pholasin per se) or in association with one or more 

25 of: a targeting or signal peptide; a glycosylate; a site capable of modification by an 
enzyme; an antigen or intracellular signal binding site; a promoter, an enhancer or an 
oncogene or a pharmacologically active substance; or the like. The present invention 
further provides a recombinant construct comprising a nucleic acid sequence encoding for 
any of these proteins; a vector containing a nucleic acid sequence encoding for any of 

30 these proteins; a host transformed by such vector; a live cell, such as bacterial, insect, 
eukaryotic, prokaryotic, archae or plant cells containing or expressing any of these 
proteins; and a rainbow protein, as described herein, together with a nucleic acid 
sequence encoding therefor. 



The present invention wfPKow be illustrated with reference to the^Kwing non-limiting 
examples, in which the methodology referred to is known to those skilled in the art and/or 
may be carried out by analogy with reference to the protocols disclosed in the following 
references, the contents of which are herein incorporated by reference in their entirety: 
5 BOOKS 

1. Campbell,AK. (1988). Chemiluminescence: principles and applications in biology 
and medicine, pp608. Horwood/VCH, Chichester and Weinheim. 

2. Campbell,AK. (1994). Rubicon: the fifth dimension of biology, pp 304. Duckworth, 
London. 

10 PAPERS 

1. CampbelI,AK Patel,A. (1983) Biochem J. 216:185-194. A homogeneous 
immunoassay for cyclic nucleotides based on chemiluminescence energy transfer. 

2. Roberts,PA Knight,J Campbell,AK. (1987) Anal. Biochem. 160:139-148. Pholasin - a 
bioluminescent indicator for detecting activation of single neutrophils. 

15 3. Mueller,T Davies,EV Campbell,AK. (1989) J. Biolum. Chemilum. 3:105-113. 
Pholasin Chemiluminescence Detects Mostly Superoxide Anion Released from 
Activated Human Neutrophils. 

4. Mueller,T Campbell,AK. (1989) J. Biolum. Chemilum. 5:25-30. The Chromophore of 
Pholasin: A Highly Luinescent Protein. 

20 5. Sala-Newby,G and Campbell,AK (1991) Biochem.J. 279:727-732. Engineering a 
bioluminescent indicator for cyclic AMP dependent protein kinase . 

6. Campbell, AK, Trewavas, AJ and Knight, MR (1996). Calcium imaging shows 
differential sensitivity to cooling and communication in luminous transgenic plants. 
Cell Calcium 19:211-218. 

25 7. Kendall, JM, Sala-Newby, G Badminton, M, Campbell AK and Rembold, CR (1996). 

Free Ca 2+ in the endoplasmic reticulum of living cells measured using aequorin as a 
pseudo luciferase. Biochem.J 318:383-387 

8. Badminton, MN, Campbell, AK and Rembold, CR (1996). J.Biol. Chem. 271:31210- 
31214. Differential regulation of nuclear and cytosolic Ca 2+ in HeLa cells. 

30 9. Sala-Newby, GB, Taylor, KT, Badminton, MN, Rembold, CR and Campbell, AK 
(1998). Imaging bioluminescent indicators shows Ca 2+ and ATP permeability 
thresholds in, live cells attacked by complement. Immunology. 93:4:601-609 

KKSaSala-Newby, GB, Kendall, JM, Jones, H, Taylor, KM, Badminton, MN, Llewellyn, 
DH and Campbell, AK (1999). Targeting bioluminescent proteins to defined 
35 compartments fo living cells. Methods in Enzymology Bioluminescence and 
Chemiluminescence. ed Ziegler, M and Baldwin, T. 
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EXAMPLE 1 : Production of a BOIP in bacteria 

c or genomic DNA coding for apopholasin, with or without the cDNA coding for the signal 
peptide, is amplified by PCR with restriction sites such as BamHl at each end. The cDNA is 
5 run on an agarose gel and the full length DNA eluted and purified. The DNA is then cut 
with BamHl to generate sticky ends and ligated into an expression plasmid such as pET3a, 
which has been cut with BamHl also. After ligation the sealed plasmid is transformed into a 
standard E.coli K12 strain such as JM109, a colony picked off for a large plasmid 
preparation. After checking that the plasmid contains the correct sequence for apopholasin 

10 and is in the correct orientation the plasmid is then used to transform a standard expression 
strain of E.coli such as BL21(DE3) or other expression strain. A colony is picked off the 
agar plate and grown up for 2h in standard LB broth. IPTG is added as inducer for a further 
2h. Apopholasin can then be extracted by breaking the bacteria by lysozyme digestion or 
sonication in a standard salt medium such as 50mM HEPES pH 7 +/- ImM ascorbate. Since 

15 the apopholasin is unglycosylated it tends to aggregate and form inclusion bodies. These 
can be broken using 8M urea or guanidinium chloride and these then dialysed off. If the pH 
of PAGE gels is alkaline this also tends to allow aggregation of both the unglycosylated and 
glycosylated apo- and full pholasin. A signal peptide such p-lactamase signal will target the 
BOIP to the periplasmic space, resulting in the ability to secrete the expressed protein from 

20 the external fluid of the cells. 

EXAMPLE 2: Production of a BOIP in insect cells 

c or genomic DNA coding for apopholasin is inserted into a plasmid suitable for 
25 conversion into baculovirus when transfected into insect cells. Since pholasin is secreted 
by Pholas itself there is a signal peptide at the N-terminus. Removal of this by PCR will 
allow cytosolic expression in insect cells. If the signal peptide is left on or changed for 
honey bee mellitin signal peptide, the apopholasin is secreted into the external medium. 
The virus containing the DNA for apopholasin is then purified and stored until required. 
30 An aliquot is then added to fresh insect cells and these incubated for 3-7 days. The 
apopholasin is then isolated from the supernatant if a signal peptide is used, or from the 
cells is not. The apopholasin can then be purified by ammonium sulphate precipitation, 
gel filtration and DEAE chromatography. The state of glycosylation can be assessed by 
running the protein on PAGE when the molecular weight is 34Kda. Removal of the 
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glycosylation by enzymes returns the protein to the size of apopholasTrT23.5Kda. It can be 
stored frozen or freeze dried, and activated to form pholasin by addition of luciferin as 
described in Example 3. 



5 Since the apopholasin tends to aggregate in the insect supernatant it is important to get the 
protein into non-aggregating buffer, e.g. 50mM HEPES pH 6, 1-1 OmM ascorbate, as soon 
as possible. 

Formation of pholasin can then be achieved as described in Example 3. 

10 

EXAMPLE 3: Generating pholasin and light emission 

To generate light the apopholasin must first be converted into pholasin with the luciferin. 
The luciferin can be extracted from native pholasin by mild acid, or by methanol, mild 
15 acid or alkaline treatment of light organs isolated from Pholas dactylus or the whole 
organism. After homogenisation the extract is centrifuged or filtered to remove 
particulate material. Further purification can be achieved by tic of hplc. The luciferin is 
best stored dry, but can be stored at -70°C. The intactness and concentration can be 
estimated by measuring the absorbance or fluorescence. The details are as follows: 

20 

(a) Isolation of the luciferin 

Four protocols (1-4) have been developed to extract and isolate the luciferin 
responsible for light emission in pholasin. The luciferin is a small organic moiety 
tightly bound to apopholasin when pholasin is isolated from Pholas dactylus, but 
25 also can be found not bound to apopholasin. Thus the extraction procedure isolates 

either form of luciferin. 

1. The organism Pholas dactylus or its light organs are homogenised in 
50mM sodium phosphate pH 6.0 on ice. The pholasin is precipitated with 
saturated ammonium sulphate (4°C stirred), and then removed by 
30 centrifugation at ca 15,000g for 30min in the cold. The supernatant is then 

passed down a SEP-PAK silica column, which binds the luciferin. The 
column is washed with 5ml ethyl acetate and then 5ml of methanol. The 
active fractions containing the luciferin are assayed either by reactivation 
of the apopholasin or by chemiluminescence in DMSO, DMF, or NaOCl. 



w 

The luciferin is concentrated and can be purified further on tic or hplc with 
a standard solvent. It is dried and stored at -70°C. 

2. The organism Pholas dactyius or its light organs are homogenised in cold 
acetone on ice, filtered through a Buchner funnel, and extracted with 
methanol :acetone (1:1), the residual powder being extracted 3 times with 
methanol and extracts combined. These are then concentrated in a 
Rotavaporator and left to stand for Ih on ice to allow further precipitation. 
The suspension is then refiltered and concentrated. The solution containing 
the luciferin is then passed down a SEP-PAK silica column which binds 
the luciferin. The column is washed with 5ml ethyl acetate and then 5ml of 
methanol. The active fractions containing the luciferin are assayed either 
by reactivation of the apopholasin or by chemiluminescence in DMSO, 
DMF, or NaOCl. The luciferin is concentrated and can be purified further 
on tic or hplc with a standard solvent. It can be dried and stored at -70°C. 

3. The organism Pholas dactyius or its light organs are homogenised in cold 
acetone on ice, and filtered through a Buchner funnel to give an acetone 
powder. This is then extracted with methanol: acetone (1:1), twice for 
lOmin and then 3 times with methanol. The extracts are combined and 
concentrated in a Rotavaporator. They are left to stand for lh on ice to 
allow further precipitation, refiltered and concentrated. The residual 
powder is resuspended in 50mM sodium phosphate pH 6.0, lOmM 
ascorbate, and ultrafiltered with a 10kD Amicon membrane at 4 C for 
pholasin. The solution containing the luciferin is then passed down a SEP- 
PAK silica column which binds the luciferin. The column is washed with 
5ml ethyl acetate and then 5ml of methanol. The active fractions 
containing the luciferin assayed either by reactivation of the apopholasin or 
by chemiluminescence in DMSO, DMF, or NaOCl. The luciferin is 
concentrated and can be purified further on tic or hplc with a standard 
solvent. It is dried and stored at -70°C. 

4. The organism Pholas dactyius or its light organs are homogenised in 
50mM HEPES buffer, with methanol and lOOmM HC1 on ice, and 
incubated for 2h on ice. After centrifugation at ca 15,000g for 30min in the 
cold, the supernatant is then passed down a SEP-PAK silica column which 
binds the luciferin. The column is washed with 5ml ethyl acetate and then 
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5ml of methanol. The active fractions containing the luciferin are assayed 
either by reactivation of the apopholasin or by chemi luminescence in 
DMSO, DMF, or NaOCI. The luciferin is concentrated and can be purified 
further on tic with a standard solvent. It is dried and stored at -70°C. 
5 Method 4 normally generates most luciferin. The luciferin is characterised by its 

absorbance and fluorescence spectrum, and by its chemiluminescence with 
DMSO, NaOCI and apopholasin. 
(b) Generation of pholasin from apopholasin and the luciferin 

A small sample of the luciferin (l-10ul) is added to apopholasin in an appropriate 
10 buffer (50mM HEPES pH 6-7.5, +/- 0.1% gelatine. +/- 1-lOmM ascorbate, or 

500mM NaCI, lOmM TES, ImM EDTA, ImM mercaptoethanol pH 6-7.5). The 
mixture is incubated at room temperature for up to 24h, and the pholasin assayed 
by adding an oxygen metabolite, e.g. NaOCI, or luciferase to a sample. When 
apopholasin has been expressed in cells, the luciferin is added externally, 
15 microinjected into individual cells or added via liposomes to get the luciferin into 

the cell. 



Light is detected and quantified in a standard luminometer, imaging camera 
(intensified or CCD), or by a silicon chip. 

20 

EXAMPLE 4: Production of a BOIP in vitro 

c or genomic DNA coding for apopholasin, with or without the signal peptide, is 
amplified by PCR with the 5' primer containing the DNA coding for T7 RNA 

25 polymerase. The DNA product is purified and precipitated. After dissolving in lOmM 
tris/lmMEDTA pH7, the DNA is added to a standard in vitro transcription/translation 
system such as rabbit reticulocyte lysate or wheat germ agglutinin and incubated at 30°C 
for 30-60min. The apopholasin can then be purified and activated to form pholasin as 
described in Example 3. 

30 EXAMPLES: Targeting a BOIP in vitro 

The BOIP can also be incorporated into a defined part of a live cell by chemical means or 
by genetically engineering the BOIP to contain a signal peptide which locates the BOIP to 
the inner or outer surface of the plasma membrane or within a particular organelle such as 



# • 

>noHon, chloroplast, tonoplast, endoplasmic i 



peroxisome, mitochoncWon, chloroplast, tonoplast, endoplasrf^r reticulum, Golgi, 
endosome, lysosome, secretory vesicle, nucleus, nucleolus, proteosome, or gap junction, 
or structure such as microtubule, cytoskeleton, nuclear skeleton, nuclear receptor, or 
mitotic spindle. The signal peptide, added either chemically or genetically, will normally 
5 target the norma] or altered BOIP to a particular intra- or extra-cellular site for example, 
the sequence MLSRLSLRLLSRYLL (SEP ID NO: 35) or part of cytochrome oxidase on 
the N-terminus will target the BOIP to the mitochondrion; 
KKSALLALMYVCPGKADKE (SBQ ID NO: 36) or MLLPVPLLLGLLGLAA (SEP ID 
NO: 37) or the ER protein calreticulin at the N-terminus will target the BOIP to the 

10 endoplasmic reticulum, a KDEL (SEQ ID NO: 38) or HDEL fSEQ ID NO: 39) sequence 
at the C-terminus retaining it there. SKL at C-terminus targets BOIP to the peroxisome, 
PKKKRKV (SEP ID NO: 1 0 or an extension of this SV40 large T-antigen signal will 
target it to the nucleus, and a palmitoylation and/or a myristoylation signal 
(MGCVCSSNPD (SEP ID NO: 42) = the LCK N-terminal acylation motif from tyrosine 

15 kinase) will target it to the plasma membrane. By coupling the BOIP to another protein 
which targets itself to a particular site then the BOIP is also targeted here. For example, 
coupling the nuclear proteins nucleoplasm in or lamin B receptor to BOIP targets it to the 
nucleus; cytochrome oxidase at the N-terminus targets BOIP to the mitochondria; 
chlorophyll at the N-terminus targets BOIP to the chloroplast; and a connexin at the N- 

20 terminus targets BOIP to the gap junction or plasma membrane, SNAP 25 to the plasma 
membrane. 



In order to target pholasin to defined sites in living cells, the DNA coding for these 
targeting sequences are added by using PCR. For cytosolic apopholasin the native signal 

25 peptide is removed and also the BOIP can be linked to larger proteins at the N- or C- 
terminus such as firefly luciferase or aequorin to prevent it getting into the nucleus. This 
also enables ATP and oxygen metabolites, or Ca 2+ and oxygen metabolites to be measured 
simultaneously in the same cells by intensity, colour or polarisation of the different 
bioluminescent indicators. A multiple bioluminescent indicator can also be engineered by 

30 PCR, or by using restriction enzyme sites, from the DNA coding for 3 or more 
bioluminescent proteins. A simple screen of the transformed bacteria enables the multiple 
rainbow protein to be isolated with 2-3 colours or more. 
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The DNA is then addeTTto an in vitro transcription/translation system as described in 
Example 4 in the presence of the organelle to be targeted (e.g. microsomes for the 
endoplasmic reticulum, which glycosylate apopholasin). 

5 The new DNA can also be inserted into a plasm id by standard techniques, and 
transformed into bacteria or transfected or injected into eukaryotic cells such as HeLa or 
COS. 

Addition of the luciferin as described in Example 3 allows formation of pholasin which 
10 can then be detected by light emission. Changes in oxygen metabolite production are then 
be detected by a luminometer or imaging camera when the cells are exposed to external 
oxygen metabolites, a change in oxygen concentration, addition of stimuli e.g. TNF, EGF, 
hormones or drugs, or attack by pathogens such as bacteria, viruses, complement, 
antibodies, toxins, and cells of the immune system. 



EXAMPLE 6: Engineering a covalent modification site into a BOIP 



(a) The site coding a protein kinase A (RRAS or kemptide), protein kinase C 
(MARCKS), MAP kinase, ERK, the ER - nuclear signalling kinase [REIP or a 
20 phosphatase is added to the N- or C-terminus or inserted at various sites within the 

apopholasin by PCR and expressed as described in Examples 1-5. Pholasin is then 
generated by addition of the luciferin as described in Example 3. 

Addition of the catalytic subunit for protein kinase A, or activation via cyclic 
25 AMP inside cells, leads to phosphorylation or dephosphorylation of the modified 

pholasin and change in light emission (intensity, colour or polarisation). 

A preliminary screen is necessary to select the appropriate proteins and to discard 
any which have lost all activity. 



(b) The site coding a protease (thrombin, enterokinase, HIV protease, caspase) is 
added to the N- or C-terminus of the apopholasin by PCR or inserted at various 
sites within the protein, and expressed as described in Examples 1-5. Pholasin is 
then generated by addition of the luciferin as described in Example 3. 



EXAMPLE 7: Engineering a BOIP into a "Rainbow Protein" 

cDNA coding for apopholasin is linked to another protein by using the cDNA coding for 
5 that protein. For example, wild type GFP, the S65T mutant of the green fluorescent 
protein, YGFP, or EGFP-_are linked to apopholasin by PCR at the N- or C -terminus, or by 
splitting one or both proteins using multi-step PCR. In between there is a 'reactive' 
peptide with a protease site (a thrombin or enterokinase) and a binding site for IP3, or the 
15 amino acid sequence form IP3 kinase (an IP 4 binding site). At the C-terminus of the 

10 GFP, a peptide containing 6 lysine residues may also be added via PCR. The protein is 
expressed and fluorescein covalently linked to these lysines by addition of fluorescein 
isothiocyanate. Addition of the luciferin forms pholasin as described in Example 3. The 
change in colour occurs by chemiluminescence resonance energy transfer. Without 
fluorescein the rainbow protein emits blue-green light (508nm), which changes to blue 

15 (490nm) when the reactive substance binds to the reactive peptide, or when either 
thrombin or enterokinase is added. When the 6 amino acid linker is used the colour starts 
as green (530nm), and will then change from green, to blue-green and then blue as the 
particular reactive sequence binds their respective analytes. Use of rhodamine instead of 
fluorescein generates a rainbow protein which changes from red to green to blue. 

20 

A preliminary screen is necessary to select the appropriate rainbow proteins and to discard 
any which have lost all activity. 

The other protein linked to apopholasin may be, for example, any one of the following 
25 linked chemically or genetically: 

1. Firefly or any benzothiazole luciferase to the N or C terminus gives two colours 
for ATP and oxygen metabolites. 

2. Any imidazolopyrazine luciferase, including coelenterazine systems - decapod 
30 shrimp, fish, sqiud, Renilla, anthzoan, Chaetognate, radiolarian, or copepod and 

Vargula systems - ostracod, Porichthys and similar fish, cypridinids and Vargula. 

3. Any tetrapyrrole luciferase such as dinofl age Hate, euphausiid or stomiatoid fish. 

4. Bacterial luciferase and other aldehyde or flavin luciferases, including polychate 
worm. 



y wild type, S65T, enhanced GFP, blue C 



5. Any GFP, including wild type, S65T, enhanced GFP, blue GFP, yellow GFP, 
Renilla GFP, Ptilocarpus GFP, and Pennatula GFP, any anthozan GFP, or any 
coelenterate GFP. 

6. The red fluorescent proteins from stomiatoid fish - Malactosteus, Aristostomias, 
5 Photostomias. 

1. The phycobiliproteins - phycoerythrin and phycocyanobilin. 

8. The blue fluorescent lumaztne protein in the bacterium Photobacterium. 

9. The yellow flavin fluorescent protein in Y Vibrio. 

10. Any lysine or argininine or other amino acid side chain where a fluor can be added 
10 covalently. In which the case the rainbow protein amy emiot more than two 

colours. For example, rhodamine on a pholasin-linker-GFP chimera will turn from 
red to green to blue. 



A preliminary screen may be necessary to select chimeras which have not lost all 
15 bioluminescent activity. 



The 'reactive' peptide may be a binding site for any analyte, protein or DNA, metabolite, 
substrate vitamin, an enzyme such a protein kinase C or phosphatase, ion channel, ion 
pump, antigen, antibody, nucleotide or nucleoside such as ATP, GTP, ADP, AMP, 
20 adenosine, cAMP, cGCMP, cCCP or their deoxy equivalents, and inositol phosphates 
such as IP3 or IP 4 , a lipid such as diacyi glycerol, phosphatidyl inositol bisphosphate, 
phosphate, a cation such as Ca 2+ , K + or Na + , Cu 2+ or Zn 2+ , or anion such as Cf , sulphate, 
or gas such as NO, O2 or H 2 , or a protein binding site such as calmodulin, kinesin, dynein, 
tubulin, or myosin. 

25 

When pholasin is triggered by oxygen metabolites, the Photos luciferase or peroxidase, 
energy transfer occurs from pholasin oxyluciferin through GFP to fluorescein resulting in 
a yellow emission. Addition of thrombin for 3h cleaves the GFP-fluorescein from the 
pholasin and the light emission returns to the blue of native pholasin. Addition of IP 3 to 
30 the full chimera alters the efficiency of energy transfer. As a result there is a change in the 
ratio of light emitted in the yellow to blue. This ratio is directly related and can be plotted 
against the concentration or amount of analyte. The light is detected in a dual wavelength 
luminometer or ratiometric imaging camera and the ratio of blue to green light measured. 
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Alternatively any fluors^mbe used, and any binding sites with the^Pft characteristics as 
shown in these examples will work provided a simple screen is used to select the right 
chimeras. 

5 EXAMPLE 8: Engineering a BOIP into a "Rainbow Protein" for two analytes 



Apopholasin is linked to firefly luciferase by using cDNAs and PCR, followed by 
expression in insect cells as described in Example 2. Addition of the luciferin as 
10 described in Example 3 generates the pholasin. In the presence of firefly luciferin (1 mM), 
ATP and oxygen metabolites, this chimera emits blue and yellow simultaneously which 
can be independently measured by using a dual wavelength luminometer or imaging 
camera. 

15 EXAMPLE 9: Expression of BOIPs in mammalian cells 

Apopholasin, c or genomic, in an expression plasmid with the CMV promoter, is 
transfected into HeLa cells. After incubation for 3 days to allow expression of the 
apopholasin, the luciferin is added to form pholasin. Expression is checked using a 
20 polyclonal antibody to pholasin raised in rabbits. Addition of oxygen metabolites outside 
the cell allows the permeability of the plasma membrane to oxygen metabolites to be 
assessed. As the oxygen metabolites permeate into the cytosol, the light emission 
increases. 



together 



25 



1-:AVT'5HS P-EGSVWJNWBkS HAVC-McWrDOC 



EXAMPLE 10: Expression of BOIPs in plants 

c or genomic DNA coding for apopholasin is inserted into a plasm id with the cauliflower 
mosaic virus promoter and transformed into Agrobacterium tumificans. These are then 
5 added to a tobacco leaf, seedlings generated, and those expressing apopholasin selected. 
The plants are grown to seed, and seedlings grown from this seed. Addition of luciferin 
forms the pholasin as described in Example 3. Stressing the plant, e.g. with wind, touch, 
cold, or peroxide, or during growth and development or by a hormone, generates light, 
showing the formation of oxygen metabolites within the live plant. A cell-specific 
10 promoter engineered on to the apopholasin cDNA before making the transgenic plant 
enables oxygen metabolites to be detected in specific cells within the whole, living plant. 

EXAMPLE 11: Detection of oxidative damage in vitro 

1 5 Addition of pholasin to serum or plasma from a rat, mouse or human enables oxygen 
metabolites to be detected and measured on addition of a drug or other substance of 
interest. 

EXAMPLE 12: Detection of ROMs in a heart cells 

20 

Reperfusion has been proposed to lead to oxygen metabolite damage in cardiac myocytes. 
Pholasin allows this to be tested for the first time. Plasmid containing apopholasin cDNA 
and the CMV promoter is transfected into isolated cardiac myocytes in culture. 
Expression occurs within 1-3 days, and pholasin is formed by addition of the luciferin as 
25 described in Example 3. Subjecting the cells to hypoxia followed by readmission of 
normal oxygen leads to light emission, showing that oxygen metabolites have been 
generated inside the cells. By using an imaging camera, the digital or analogue nature of 
this can be assessed as the number of cells emitting light can be visualised and counted. 

30 
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EXAMPLE 13: Detection of ROMs in the nucleus and endoplasmic reticulum (ER) 

Plasmid-containing apopholasin cDNA with either nucleoplasm in DNA or calreticulin 
DNA (with or without KDEL on the C-terminus) linked to the pholasin DNA, to target 
5 the apopholasin to the nucleus or ER respectively, and the CMV promoter for expression, 
is transfected into HeLa cells in culture. Expression occurs within 1-3 days, and pholasin 
is formed by addition of the luciferin as described in Example 3. Addition of oxygen 
metabolites outside the cells, or hypoxic/oxygen shock generates light measured in a 
luminometer, showing how fast oxygen metabolites penetrate into these organelles. By 
10 imaging with a photon counting imaging camera, the number of cetls permeable to 
oxygen metabolites can be counted. Location of the pholasin can be assessed by imaging 
live cells, or by using immunofluorescence with the pholasin antibody on partially-fixed 
cells or GFP-pholasin in live cells. Using a rainbow protein, two or more analytes can be 
detected together. 

15 

EXAMPLE 14: Use of pholasin as a protein label 

Pholasin can be used as a label in homogeneous or heterogeneous immunoassay. 
Apopholasin is first covalently linked to an antibody to HIV, and pholasin formed by 
20 addition of luciferin as described in Example 3. The antibody is then used in a standard 
chemiluminometric immunoassay format. Addition of HIV antigen leads to an increase in 
antibody binding and an increase in light emission dependent on the amount of HIV 
added. The amount of HIV in a blood sample can be assessed by relating the pholasin 
light emission in the sample to the standard curve. 

25 

EXAMPLE 15: Pholasin as a DNA label 

Apopholasin is covalently linked to an oligonucleotide probe for detecting the presence of 
the cystic fibrosis gene. Addition of the probe to DNA in a standard Southern blot allows 
30 the probe to bind when the gene is present. Addition of luciferin as described in Example 
3 allows the pholasin to form. Addition of hypochlorite (lOmM) in barbitone buffer pH 9 
causes the pholasin to flash and the gene can be visualised by the photon counting 
imaging camera. 
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EXAMPLE 16: PholasWTn a two hybrid system 



Protein -protein interaction can be detected by engineering apopholasin to one half of a 
two hybrid system and GFP to the other. Binding will allow the yeast to grow. 

5 

EXAMPLE 17: Pholasin in genetic entertainment 

Pholasin is able to chemiluminesce at a wide range of pH (3-10), including acid pH such 
as 3-4. Thus it can be added to drinks such as beer, cola, soft drinks, and spirits to make 
10 them glow. It can also make food glow by adding to them to the ingredients of cakes, 
icing, popcorn; by painting the pholasin or apopholasin on to the food, or by making it 
genetically in the source of the food. It can be used in a wide range of toys and other 
entertaining devices including squirt guns, greeting cards, pens. 

15 The rainbow proteins can also be used as an alternative to pholasin alone, resulting in a 
rainbow of colours and colour changes. 

EXAMPLE 18: Pholasin in trangenic animals 

20 Transgenic animals such as nematodes, mice or plants can be generated from apopholasin 
cDNA by standard techniques. Injecting the luciferin or incubating whole plant in it forms 
the active pholasin. Oxygen or its metabolites can then be detected, measured and imaged, 
in an intact organ, or from the whole organism. It can also be used in humans, in DNA 
therapy or diagnosis. 

25 

EXAMPLE 19: Apoprotein from the luminous squid Ommastrophes 

The use of apoprotein from the luminous squid Ommastrophes is substituted for 
apopholasin, and the methods of Examples 1 to 18, above, are carried out. 

30 

EXAMPLE 20: Apoprotein from the mollusc Rocellaria 



The use of the apoprotein from the mollusc Rocellaria is substituted for apopholasin, and 
the methods of Examples 1 to 18, above, are carried out. 
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EXAMPLE 21: Earthworm iuciferase 

The use of earthworm Iuciferase as a BOIP is substituted for apopholasin, and the 
methods of Examples 1 to 18, above, are carried out. 

EXAMPLE 22 

Genomic DNA from Pholas, Rocellaria, Ommastrophes, or earthworm is substituted for 
the recombinant protein in Examples 1 to 18, above, the methods of which are carried out 
in an analogous manner. 
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The present invention relates to a protein, capable of bioluminescence, cDNA coding 
therefor and their uses, inter alia, in diagnostics and therapy. In particular, this invention 
5 relates to the cloning and sequencing of cDNA coding for pholasin from the bivalve 
mollusc Pholas dactylus. 

The term 'bioluminescence' refers to the emission of light resulting from a chemical 
reaction within, or produced by, a living organism. The essential components to the 
10 ' chemical reaction are: an organic molecule, usually comprising a luciferin; oxygen or one 
of its metabolites; and an enzyme or luciferase that catalyses the oxidation of the luciferin. 
The chemiluminescent reaction responsible for bioluminescence may be represented as 
follows: 

15 Luciferin + 0 2 , or 0~ 2 , or H 2 0 2 , or OH or OCT or OX" or '0 2 (+ luciferase) > 

oxyluciferin + light. 

Up to three other substances may also be required to generate light or to generate light of 
the required colour and intensity. These are as follows: 



(a) A cation, such as H + , Ca 2+ , Mg 2+ or a transition metal cation {eg Cu /Cu , 
Fe 2 7Fe 3+ , La 3+ and V 3+ ); 

(b) A co-factor such as NAD(P)H, FMN or ATP; and/or 

(c) A fluor as an energy transfer acceptor. 

25 

Five chemical families of luciferin are known: 

(a) Aldehydes (found in the freshwater limpet Latia, earthworms, and with FMN in 
bacteria); 

30 (b) Imidazolopyrazines, which are the compounds most commonly responsible for 
bioluminescence in the sea (found in Sarcomastigophora, Cnidaria, Ctenophora, 
Annelida, Chaetognatha, some Arthropoda, some Mollusca and some Chordata); 

(c) Benzothiazoles (found in beetles such as fireflies and glow-worms); 

(d) Linear tetrapyrroles (found in dinoflagellates, euphausiid shrimp and some fish); 



20 



and 

(e) Flavins (found in bacteria, fungi, polychaete worms and some molluscs). 

Chemiluminescent reactions involving these luciferins may produce a glow or a flash with 
5 an emission of violet, blue, blue-green, green, yellow, orange or red light, or occasionally 
UV or IR light. The light emission may be linearly or circularly polarised. The luciferin 
or its product may also be detected and quantified by fluorescence or phosphorescence. 
As a chemical reaction is directly responsible for the light emission, it does not require 
exposure to UV, visible or IR light. However, some bioluminescent systems, such as that 
10 in the red organ of the deep sea fish Malacosieus, exhibit a photo-chemi luminescence, 
where light can trigger or enhance the chemiluminescent reaction. [Reference is directed 
to Chemiluminescence: Principles and Applications in Biology and Medicine, A K 
Campbell (1988), Horwood/VCH Chichester, Weinheim.] 

15 In the case of some bioluminescent proteins, the luciferin is so tightly or covalently bound 
to the protein molecule that it does not diffuse away into the surrounding fluid as a result 
of the chemiluminescent reaction. In this case, the protein-luciferin complex is known as 
a photoprotein; and the protein itself is referred to as an apophotoprotein. Some 
bioluminescent proteins are proteins whose light emission or radiation depends on or may 

20 be altered by oxygen or one of its metabolites; these bioluminescent proteins are 
hereinafter referred to as 'bioluminescent oxidative indicator proteins' (BOIPs).BOIPS 
may thus be photoproteins or luciferin-luciferase systems. 

BOIPs, therefore, may be used to detect and quantify oxygen or one of its metabolites in 
25 individual cells, defined compartments of living cells such as the nucleus, whole organs 
and organisms - both animals and plants, including microbes such as viruses and bacteria 
and protozoa - as well as substances of biological interest such as substrates, metabolites, 
vitamins, drugs, intra- and extra-cellular signals, enzymes, antigens, antibodies and 
nucleic acids. Heretofore, it has only been known to employ native BOIPs extracellularly. 

30 

The present invention therefore relates to a method for the detection and/or measurement 
of oxygen or one of its metabolites in live cells (intracellular), which method comprises 
providing a BOIP, such as native or chemically- (or genetically-) modified BOIP or a 
'rainbow protein' based on such a BOIP, intracellular^ and thereafter detecting and/or 



quantifying light em SIR therefrom and/or changes in coW^ intensity and/or 
polarisation of emission(s) therefrom. 

Furthermore, it has now been found that, by sequencing the BOIP and identifying the 
5 cDNA encoding therefor, the recombinant BOIP can also be used in such a method, or 
chemically- or genetically-modified recombinant BOIP, or a 'rainbow protein' based on 
such a BOIP. For example, the bivalve mollusc Pholas dactylus has been shown to 
comprise a native photoprotein, which interacts with a luciferase, when they are secreted 
together by the mollusc to produce light when O2 or one of its metabolites is present. 

10 References to the Purification and Properties of Pholas Dactylus Luciferin and Luciferase 
can be found by Michelson in Methods in Enzymology LVII 385-406 (1978). References 
to detection of activation of neutrophils by detection of superoxide anion can be found by 
Roberts in Anal Biochem 160 139-148 (1987) and by Muller et al in J Biolum Chemilum 
3 105-113 (1989). The native photoprotein (known as pholasin) is made up of a 

15 glycosylated apoprotein (34kDa) with a small organic molecule, the luciferin, tightly 
bound to it. This luciferin (whose structure is unknown - Muller and Campbell in J 
Biolum Chemilum 5 25-30 (1990)) can be extracted from the protein moiety - the 
apopholasin - or from the organism by a standard treatment, such as mild acid. The 
pholasin may be collected from live molluscs found in sedimentary rocks at low water 

20 along the south coast of England from Plymouth to Folkestone and also along the French 
channel coast and in the Mediterranean. Further details may be obtained from marine 
fauna and the references cited herein. 



We have surprisingly found that pholasin can generate light even without the presence of 
25 the corresponding luciferase by addition of oxygen metabolites such as 0 2 ~, H 2 0 2 , OC1- 
or other oxyhalide anions, or organic peroxides, and certain organic solvents such as 
dimethyl sulphide (DMSO) or dimethyl formamide (DMF). 

We have now identified the cDNA encoding for the (non-glycosylated) apoprotein of 
30 pholasin, which may also be called 'apopholasin'. Accordingly, the present invention 
therefore further provides an isolated, purified or recombinant nucleic acid sequence 
comprising: 

(a) The apophotoprotein of pholasin (alternatively, 'apopholasin'); 



(b) A sequence subSlmially homologous to or that hybridises^^equence (a) under 
stringent conditions; or 

(c) A sequence substantially homologous to or that hybridises under stringent 
conditions to the sequence (a) or (b) but for the degeneracy of the genetic code; or 

5 (d) An oligonucleotide specific for any of the sequences (a), (b) or (c). 

The present invention will now be further described with reference to the accompanying 
Figures, in which: 

10 Figure 1 (SEQ ID NOS 1-3) shows three different cDNAs encoding apopholasin, referred 
to as clones 40, 3 and 5. Nucleotides in bold type show codons used for initiation and 
termination of translation; 

Figure 2 (SEQ ID NOS 1-3) shows the three sequences of Figure 1 (SEQ ID NOS 1-3) 
15 aligned to demonstrate the sequence similarity. This figure was generated by Clustal. 
Positions which are indicated with a star are identical in all three clones. The codons for 
the initiation and termination of translation are highlighted in bold; 

Figure 3 (SEQ ID NO: 1) shows the oligonucleotides used for the complete sequencing of 
20 the positive clones. These were identified from the cDNA library; their positions in clone 
40 are shown. Oligonucleotides are shown in bold type, portions of the flanking sequence 
of the Bluescript plasmid are shown in italic type; 

Figure 4 (SEQ ID NOS 1 and 4) describes the protein sequence described by the DNA 
25 sequence coding for apopholasin and shows, in Figure 4A, the complete sequence (SEQ 
ID Nos 1 and 4) of the positive clone 40 identified from the Pholas dactylus light organ 
library. The first 20 amino acids at the N-terminus are a signal peptide, and this can either 
be retained or removed when generating the BIOP as described in this invention and, in 
Figure 4B (SEQ ID NO: 1), the cDNA coding for apopholasin with untranslated 5' and 3' 
30 ends. The untranslated regions are also shown; 



Figure 5 (SEQ ID NOS 5-6) describes the protein sequence for pholasin with (5B) (SEQ 
ID NO: 6) and without (5A) (SEQ ID NO: 5) the signal peptide; 



piece of SEQ ID NO: 1 and SEQ 



Figure 6 (SEQ ID NOP^ piece of SEQ ID NO: 1 and SEQ IWO: 23) shows the 
sequence for apopholasin genomic DNA. Two gDNA clones were indentifed but no 
introns were found; the Figure shows an alignment of the cDNA from cDNA clone 40 
(SEQ ID NO: 1) and the gDNA amplified by both xTth DNA polymerase XL (SEQ ID 
5 NO: 23) and BioXAct polymerase (piece of SEQ ID NO: 1). The sequences of the PCR 
product and the inserts in pGEM T were aligned with the sequence of the cDNA of clone 
40 and were identical to this cDNA; 

Figure 7 (SEQ ID NOS 7-22) describes the oligonucleotides used for screening and 
expression. Degenerate oligonucleotides for library screening are shown in Figure 7A; 
10 (SEQ ID NOS 7-10) non-degenerate ones in Figure 7B (SEQ ID NOS 7-16); and 
oligonucleotides used for protein expression are shown in figure 7C (SEQ ID NOS 17-22 
). 

Figure 8 lists the main restriction sites in the DNA for engineering pholasin; 
Figure 9 (piece of SEQ ID NO: 1) is a schematic representation of Figure 8 mapped to the 
1 5 sequence of Figure 4A (translated region), and 

Figure 10 is a time course of hypo-chlorite triggered chemiluminescence. 

Accordingly, the present invention provides recombinant DNA encoding the 
apophotoprotein apopholasin and comprising the nucleotide sequence of the sequence 

20 disclosed in Figure 4B (SEQ ID NO: 1). Three different cDNAs coding for apopholasin 
have been isolated, having differing non-coding regions, respectively disclosed in Figure 
1 (SEQ ID NOS 1-3). The genomic DNA (gDNA), which contains no introns, has been 
shown (Figure 6) (SEQ ID NO: 1, piece of SEQ ID NO: 1, SEQ ID NO: 23) to comprise 
the same basic sequence as the cDNA. 

25 Pholasin is a glycoprotein having 11.1 glusamine, 9.8 fructose, 7.1 mannose and 5.2 
galactose residues. The cDNA for apopholasin has a molecular weight of 23,456 
compared to 34,600 of the pholasin extracted from Pholas. The difference in the 
molecular weights of native versus recombinant apopholasin is due to the glycosylation of 
the native protein and luciferin. The isoelectric point of the translated protein calculated 

30 by the ISOELECTRIC command of the GCG programme is at 3.84. The native protein 
has a lower isoelectric point (<3.5), probably due to the presence of bound sulphate. 
The three clones (Figure 2) (SEQ ID NOS 1-3) isolated from the library encode a unique 
protein (Figures 4 and 5) (SEQ ID NOS 1 and 4-6), which does not have the same amino 
acid sequence as any known protein in the SwissProt database. The present invention 



therefore not only proWes cDNA and RNA coding for the^Ktein, but also the 
recombinant protein per se, with or without glycosylation units. A comparison of 
segments of the pholasin protein sequence with the proteins in the SwissProt database 
identified several proteins with regions having a high sequence similarity to regions of the 
5 cloned protein. These included several proteins which interact with nucleotides (Table 1). 



Table 1 A comparison of sections of the sequence of the cloned protein with sections 

of proteins which interact with nucleotides. 



Protein 


Homologous region 
cloned protein 

homology (+ denotes a conserved amino acid) 
selected protein 


tRNA-splicing endonuclease p 
subunit (piece of SEQ ID NO: 4) 
Saccharomyces cerevisiae 
EC 3.1.27.9 (SEQ ID NO: 24) 


SLYDEDNNGVMDEGKVI Pobl lb 
+L DEDNN + + G ++P E++E 
NLRDEDNNLLDENGDLLPLESLE 

LDQDVELDYTW 
LD DV DYTW 
LDHDVSKDYTW 


ATP-AMP transphosphorylase 
(piece of SEQ ID NO: 5)Cyprinus 
carpio 

EC 2.7.4.3 (SEQ ID NO: 25) 


VMDEGKVI PSETIEDDIKDCGLLDQDVELDY 
+M +G+++P +T+ D IKD + DV Y 
IMQKGELVPLDTVLDMIKDAMIAKADVbKGY 


DNA primase (piece of SEQ ID 
NO: 6) 

Synechocystis sp. 

EC 2.7.7. -(SEQ ID NO: 26) 


EEVQCAMNWTQANEYVFNVD 
++VQ M ++Q+ + + FN D 
DQ V Qb LMKfc bUbis.yl 1 r N r JJ 


purine permease (piece of SEQ ID 
NO: 6) 

Emericella nidulans (SEQ ID NO: 
27) 


VQCAMNWTQANEYV 
IMCSVDWTRRNRFI 


DNA repair protein complementing 
XP-A cells homologue (piece of 
SEQ ID NO: 5) 

Drosophila melanogaster (SEQ ID 
NO: 28) 


PDTVDEAEDTPSET 
PDT DE EDT + T 
PDTYDEEEDTYTHT 


ATP synthase (3 chain (piece of 
SEQ ID NO: 5) 
Peptococcus niger 
EC 3.6.1.34 (SEQ ID NO: 29) 


DTVDEAEDTPSET 
D +DEA + PSET 
DPIDEAGEVPSET 


DNA polymerase a (piece of SEQ 
ID NO: 6) 
Homo sapiens 


DEDNNGVMDEGKVI PSETIEDDIKD 
D+D G +++G+ I + +EDD D 
DDDGIGYVEDGREI FDDDLEDDALD 



EC 2.7.7.7 (SEQ ID NOPfD) 



Similarity was found between the Vargula iuciferase and Renilla LBP, but no other 
bioluminescent protein. 



5 Sequence homology between the cloned protein (piece of SEQ ID NO: 5) and (a) Vargula 
iuciferase (SEQ ID NO: 31) (b) Renilla LBP (SEQ ID NO: 32). An area of high 
homology in all three proteins is in bold print. 



10 (a) 148 206 
GTIWTVRVSLYDEDNNGVMDEGKVIPSETIEDDIKDCGLLD-QDVELDYTWTQNECDL 
V+VSL D+ + + T+ DID + V++ + + 

YWNTWDVKVSLRDVESYTEVEKVTIRKQSTWDLIVDGKQVKVGGVDVSIPYSSENTSI 
353 412 

15 (b) 

105 166 
STMPGTYMLMDVCATRDADDKCIEGTIWTVRVSLYDEDNNGVMDEGKVIPSETIEDDIKDC 

+ TR + VR+S+ + N+ K I 

AIKIAKLSAEKAEETRGFLRVADQLGLAPGVRISVEEAAVNATDSLLFCMKAEEKAMAVIQSL 
20 41 104 



Three potential glycosylation sites on the protein have the consensus triplet sequence Asn- 
Xaa-Ser/Thr (where Xaa can be any residue except proline). Thr 216 was identified as a 
potential site of O-linked glycosylation by a neural network which has been trained to 
25 identify this type of glycosylation. The amino acid sequence was also entered into a neural 
network which had been trained to identify eukaryotic signal peptides. This confirmed that 
the most likely cleavage site is between positions 20 and 21 (GSG-EE). 



Many families of proteins contain a "signature" sequence of amino acids. The sequence of 
30 the clones did not contain any of these signatures present in the PROSITE database. The 
amino acids from 170 to 185 correspond to the calcium binding consensus sequence 
[DENQST]X[DENQST]X[DENQST]X[DENQST]X[DENQST]XX[DENQST]. Thirteen 
potential phosphorylation sites were discovered that matched the consensus sequences for 
either the kinase phosphorylation site [RK](2)-x-[ST], the protein kinase C phosphorylation 



site [ST]-x[RK] or the cWm kinase II phosphorylation site [ST]-x(2t^DE]. 

Three N-linked glycosylation sites were identified in the translated sequence of the clones A 
neural network has been trained to identify this type of glycosylation which identified Thr 
5 216 as a potential site of O-linked glycosylation. At least one of these sites must be 
glycosylated in the native protein in order to account for the presence of the sugar residues. 
A putative signal peptide region preceded theN terminus of the secreted protein (determined 
by amino acid sequencing and was identified as a signal peptide by a neural network). To 
confirm this result the protein sequence was searched with PSORT for motifs which would 
10 locate the cloned protein in a cellular compartment. The protein sequence did not contain 
any transmembrane regions or N-myristoylation patterns which would indicate the presence 
of a lipid anchor. No targeting or retention sequences were found for the nucleus, 
mitochondria, endoplasmic reticulum or peroxisome. 



15 The fact that the clones had some sequence similarity with proteins that interact with 
nucleotides may suggest that pholasin binds a co-factor as part of the chemiluminescent 
reaction. Beetle luciferases require ATP binding for chemiluminescent activity. There is no 
P-loop binding motif ((A,G)x4GK(S,T) or (A)x{4}GK(T)) (SEQ ID NO: 33) in the amino 
acid sequence of these clones. However, not all ATP binding proteins contain this motif. 

20 Neither does the cloned protein contain the GXGXXG phosphate binding consensus 
sequence necessary for the binding of other co-factors such as nicotinamide adenine 
dinucleotide. 



The amino acid and sugar components of pholasin are not able to emit light at the 
25 wavelength of the native protein (490nm). This indicates that there must be a chromophore 
bound to the protein. There are, however, proteins in which the chromophore is composed 
of modified amino acid residues within the polypeptide. The best characterised of these is 
the green fluorescent protein (GFP). This has a chromophore which is a ring formed by the 
autocatalytic cyclisation of the residues Ser-dehydroTyr-GJy. The serine may be mutated to 
30 a threonine, which increases the amplitude of the emission at 488nm. Pholasin had no 
similar amino acid sequence. Putative luciferin binding regions have been identified for two 
bioluminescent chemistries. Aequorin has a putative coelenterazine binding region, which 
is also present in two sections of the Vargula hilgendorfii luciferase. The sequence of the 
cloned protein has no homology to the putative luciferin binding site of aequorin, but the 



region of the Vargula lcWKrase from residue 353 to 41 1 has some^Kilarity, as does the 
LBP of Renilla reniformis, which also binds an imidazolopyrazine. This may indicate that 
the chemistry of pholasin bioluminescence involves an imidazolopyrazine luciferin. 
However, the region of homology is very small. The beetle luciferases contain an area of 

5 low sequence homology which may bind the benzothiazole luciferin. This low homology 
may account for the different colours of beetle bioluminescence. used a luciferin analogue 
(2-(4-benzoylphenyl)thiazole-4-carboxylic acid which photoinactivated the luciferase active 
site of the firefly Photinus pyralis. This photoinactivation was directly linked to the 
degradation of a small peptide sequence HHGF (SEQ ID NO: 34) (residues 244-257). This 

10 is therefore postulated as a luciferin-binding site. The cloned protein does not have any 
sequence homology with these putative binding regions. Two strongly conserved regions of 
amino acids have also been found in both the luciferase and the luciferin binding protein of 
the dinoflagellate Gonyaulax polyedra. These regions were compared to the cloned protein, 
but no sequence similarity was found. No sequence identity could be established between 

1 5 the bacterial luciferases and the cloned protein. 



Therefore, the present invention provides cloned apophotoprotein apopholasin (and the 
cDNA coding therefor), which has identical properties to native (but non-glycosylated) 
apopholasin with respect to molecular weight, amino acid composition, potential for 
20 glycosylation, its highly acidic pi and its cellular location. Hence, the present invention 
can further provide the corresponding BOIP or modified BOIP, according to standard 
methods. 



The corresponding BOIP is preparable by bringing the apophotoprotein pholasin into 
25 association with the luciferin, also using standard methods. Although the luciferin is 
tightly bound in the native pholasin BOIP, it has been found that it may not be the case in 
the recombinant pholasin BOIP; indeed the luciferin may be weakly bound or merely 
present with the apoprotein. For example, a methanol, aqueous, acidic or other extract of 
Pholas dactylus (whole organism or light organ dissected from the animal) containing the 
30 'luciferin', or the pure luciferin, may be added to the solution, celt or organism (Figure 10 
shows the time course of hypo-chlorite triggered luminescence in these circumstances). A 
time course of apopholasin reactivation was performed by incubating partially purified 
recombinant pholasin secreted by insect cells with acid:menthanol extract of Pholas 
dactylus (•) for 0 (solid line), 1 (---), 2 ( ), 6( ) or 24 hours ( ), or incubated 



without extract (o). Co^ffols of buffers only with no protein or acWrnethanol extract (□) 

and extract alone (■) were treated in identical conditions. Chemi luminescence was 

triggered by the addtion of 2% soidum hypochlorite at 10 seconds (arrow) and is shown as 
chemiluminescent counts minus background light. A typical curve obtained by 
5 hypochlorite triggering of native pholasin is also shown (X). A representative experiment 
carried out in duplicate is shown. 

The luciferin associates with the apo-BOIP forming the photoprotein or remains loosely 
bound so that it turns over like a luciferase.. The luciferin on the photoprotein then reacts 
10 with oxygen or one of its metabolites to produce light, in the presence or absence of the 
luciferase. The light emission may be detected, quantified, or imaged using a 
luminometer, photographic film or imaging camera, or by the naked eye. Alternatively, 
light emission may be generated spontaneously by intra- or extra-cellular metabolites 
reacting with the apo-BOIP. 

15 

Although illustrated with respect to pholasin, the following may apply to any BOIP: the 
BOIP can be produced directly from native DNA, or from DNA engineered or amplified 
by the polymerase chain reaction. By this means, sites can be inserted within the protein 
by splitting the DNA into two or more pieces, or by adding DNA sequences to the 5' or 3' 
20 ends. For example, the DNA may be expressed in bacteria, yeast, an insect or human cell, 
or other suitable organism to produce protein which can be extracted and used. 



In this instance, the protein produced from the cloned DNA reacts with oxygen or a 
metabolite of oxygen, such as the superoxide anion (CV), hydrogen peroxide (H2O2), 

25 hydroxy radical (OH ), an oxyhalide anion (OCP, OBr\ Of, OSCN), nitric oxide (NO), an 
organic hydroperoxide or a radical ROO . The change in light emission enables the 
oxygen or metabolite(s) to be detected and quantified in live cells, organelles, or on the 
outer or inner surface of the plasma membrane, or within an organ of a live organism 
without the need to break them open or the need to separate bound and free fractions. 

30 This also enables an enzyme producing oxygen or one of its metabolites, such as 
chlorophyll, or enzymes such as oxidases and oxygenases which react directly with 
oxygen or one of its metabolites to attach oxygen to the substrate to be detected and 



quantified in live cells,^fans and whole organisms, or extracts fro^RTny one of these. 

Also the BOIP can be made in vitro by transcription/translation in a cell lysate such as 
rabbit reticulocyte lysate or wheat germ extract containing RNA polymerase. The DNA 
5 for the BOIP is first engineered to contain an RNA polymerase promoter, such as T7, 
SP6; bacterial promoter(s), such as lac, ara or trp; or mammalian promoters), such as 
actin, myosin, myelin proteins, TK, MRT-V, SV40, CMV, RSV, metallothionine, 
antibody, G6P dehydrogenase, and can be amplified in vitro using the polymerase chain 
reaction. A poly-A tail may be added at the 3' end and a tissue specific promoter or 
10 enhancer sequence added to the 5' or 3' end of the DNA coding for the BOIP or modified 
BOIP, enabling it to be expressed specifically in a target cell, such as a myocardial cell or 
a cancer cell. The expression of the BOIP in the target cell is detected and quantified by 
light intensity, colour or polarisation, as previously mentioned. 



15 The BOIP, or its DNA or RNA, may be incorporated into a live bacteria or eukaryotic cell 
using phage, virus, plasm id, calcium phosphate transfection, electroporation, liposome 
fusion, membrane pore forming proteins, micro-injection or DNA gun. Once inside cells 
or an appropriate extracellular environment, cell activation or injury will initiate or 
change the light emission from the BOIP. For example, expression in live organisms by 

20 micro-injection of protein, RNA or DNA, or by transgenic manipulation to produce a cell, 
such as a bacterial, microbial, animal or plant cell, eg a white blood cell, a heart cell, or a 
yeast, protozoan, fruit fly (Drosophila), nematode worm, polychaete worm, fish, human, 
mouse, rat, sheep, pig, horse or plant, which can generate its own light. 



25 

The BOIP can also be incorporated into a defined part of a live cell by chemical means or 
by genetically engineering the BOIP to contain a signal peptide, which locates the BOIP 
to the inner or outer surface of the plasma membrane or within a particular organelle, such 
as peroxisome, mitochondrion, chloroplast, tonoplast, endoplasmic, reticulum, Golgi 
30 apparatus, endosome, lysosome, secretory vesicle, nucleus, nucleolus, nuclear membrane, 
plasma membrane, proteosome, or gap junction, or structure such as membrane receptor 
ion channel microtubule, cytoskeleton, nuclear skeleton, nuclear receptor, mitotic spindle 
or microfilaments. The signal peptide, added either chemically or genetically, will 
normally target the normal or modified BOIP to a particular intra- or extra-cellular site. 



For example, the seqWhce MLSRLSLRLLSRYLL (SEQ ID^D: 35) or part of 
cytochrome oxidase on the N-terminus will target the BOIP to the mitochondrion; 
KKSALLALMYVCPGICADKE (SEQ ID NO: 36) or MLLPVPLLLGLLGLAA (SEQ ID 
NO: 37) at the N-terminus will target the BOIP to the endoplasmic reticulum, a KDEL 
5 (SEQ ID NO: 38) or HDEL (SEQ ID NO: 39) or KEEL sequence (SEQ ID NO: 40) at the 
C-terminus retaining it there. SKL at C-terminus targets BOIP to the peroxisome; 
PKKKRKV (SEQ ID NO: 41) or an extension of this SV40 large T-antigen signal will 
target it to the nucleus; and a palmitoylation and/or a myristoylation signal will target it 
to the plasma membrane. By coupling the BOIP to another protein that targets itself to a 
10 particular site, the BOIP can also be targeted there. For example, coupling the nuclear 
proteins nucleoplasm^ or lamin B receptor to BOIP targets it to the nucleus; cytochrome 
oxidase at the N-terminus targets BOIP to the mitochondria; chlorophyll at the N- 
terminus targets BOIP to the chloroplast; a connexin at the N-terminus targets BOIP to 
the gap junction or plasma membrane; and SNAP 25 to the plasma membrane. 

15 

Other modifications to the apoprotein, BOIP, or nucleotides coding therefor include, but 
are not limited to: 



The apoprotein, such as apopholasin, may also be glycosylated, and used to detect and 
20 quantify secretion or movement of proteins through the secretory pathway. 

Nucleic acid coding for the BOIP when expressed inside a live cell may not only be 
modified but also regulated in this cell by gene expression, such as by promoters, 
enhancers or oncogenes. For example, the apoprotein, such as apopholasin, may be 
25 coupled to a gene regulator protein, such as a transcription factor, by genetic or chemical 
manipulation, such that the movement through a cell or of the regulator protein or its 
activity, can be detected or quantified. 

The BOIP or apoprotein, or its DNA may be linked to another protein or DNA used in 
30 therapy, such that the other protein or DNA can be detected in live cells or in a whole 
organism, eg a human. 

The apoprotein, such as apopholasin, can also engineered genetically or chemically to 
contain a site or sites which can be covalently modified by enzymes such as 



10 



phosphorylation (irtcreWng ser/thr, his and tyr kinases^lnd phosphatases), 
tranglutamination, proteolysis, ADP ribosylation, gly-or glu-cosylation, halogenation, 
oxidation, methylation, palmitoylation, myristylation and farnesylation. 



The apoprotein, such as apopholasin, can be engineered genetically or chemically to 
contain an antigen or intracellular signal binding site, such as Ca 2+ , cyclic AMP, cyclic 
GMP, cyclic CMP, IP 3 , IP 4l diacyl glycerol, ATP, ADP, AMP, GTP, or any oxy- or 
deoxy-ribonucleoside or nucleotide, a substrate, a drug, a nucleic and/or a gene regulator 
protein. 



The BOIP may also be converted to a rainbow protein by engineering a particular site 
such as described hereinabove into the BOIP, at the N- or C-terminus, or between a 
chimera of the BOIP and an energy transfer acceptor, such as GFP (wild type or any of the 
mutant GFPs). This is known as chemiluminescence, bioluminescence or fluorescence 

15 resonance transfer (CRET, BRET or FRET, respectively). Conversion of the BOIP to a 
'rainbow protein' may be effected by reaction with a cellular substance, modification 
genetically or chemically, or by linking the BOIP to a fluor, such as the green fluorescent 
protein or the red fluorescent protein in the deep sea fish Malacosteus. The result is a 
BOIP which changes its colour and/or intensity and/or polarisation of emission. The 

20 change in colour occurs by energy transfer, eg resonance transfer (CRET or FRET) or 
electron transfer. 



The initial (unmodified) BOIP may be the apophotoprotein, its DNA or RNA, from the 
bivalve mollusc Pholas dactylus, or it may be another BOIP, such as one from the 
25 mollusc Rocellaria grandis or the squid Ommastraphes, or earthworm luciferase, which 
produce light with oxygen metabolites in a way very similar to Pholas dactylus. 

The BOIP, apo-BOIP, or nucleic acid coding for it, whether modified or not, may 
therefore be used in a range of biology and investigations such as: 

30 

(a) Detection, location and measurement of signals in substrates, such as live cells, 
organs or organisms, or in extracellular fluids; 

(b) Detection, location and measurement of oxygen and its metabolites in substrates, 
such as live cells, organs or organisms, or in extracellular fluids, water (sea and 
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fresh), soil or tfWffmosphere; 

(c) Detection and location of normal cells such as microbes (protozoa, yeast, fungi, 
moulds, bacteria, viruses); 

(d) Detection and location of abnormal cells, such as cancer cells, hyperactive cells in 
5 rheumatoid arthritis and other inflammatory diseases, cells infected with a 

pathogen, such as a virus or other infectious agents, cells damaged by physical, 
chemical or biological attack, cells damaged by perfusion or reperfusion injury or 
cells damaged by oxygen or one of its metabolites; 

(e) Measurement and location of enzymes, particularly those producing oxygen or its 
10 metabolites, and other tumour reactions in cells or biological fluids; 

(f) DNA and RNA binding assays; 

(g) Immunoassay and other protein binding assays; 

(h) In genetic engineering, in the development of transgenic animals and plants, and 
microbes; in horticulture; agriculture; medicine and veterinary medicine; and/or 

15 (i) in genetic entertainment by incorporation into light sticks, greeting cards or toys to 
produce light of various colour, intensities, oscillations, flashes and glows; or in 
comestibles, such as food, drinks, including beers, wines, spirits, colas and other 
soft drinks. 



20 

Accordingly, the present invention further provides an apoprotein, such as pholasin 
apoprotein (or apopholasin) in both unglycosylated and glycosylated forms, and a BOIP 
thereof, such as pholasin, either alone (but excluding native proteins per se that have 
already been isolated, such as native pholasin per se) or in association with one or more 

25 of: a targeting or signal peptide; a glycosylate; a site capable of modification by an 
enzyme; an antigen or intracellular signal binding site; a promoter, an enhancer or an 
oncogene or a pharmacologically active substance; or the like. The present invention 
further provides a recombinant construct comprising a nucleic acid sequence encoding for 
any of these proteins; a vector containing a nucleic acid sequence encoding for any of 

30 these proteins; a host transformed by such vector; a live cell, such as bacterial, insect, 
eukaryotic, prokaryotic, archae or plant cells containing or expressing any of these 
proteins; and a rainbow protein, as described herein, together with a nucleic acid 
sequence encoding therefor. 



The present invention vSWIow be illustrated with reference to the^K>wing non-limiting 
examples, in which the methodology referred to is known to those skilled in the art and/or 
may be carried out by analogy with reference to the protocols disclosed in the following 
references, the contents of which are herein incorporated by reference in their entirety: 
5 BOOKS 

1. Campbell,AK. (1988). Chemiluminescence: principles and applications in biology 
and medicine, pp608. Horwood/VCH, Chichester and Weinheim. 

2. Campbell,AK. (1994). Rubicon: the fifth dimension of biology, pp 304. Duckworth, 
London. 

10 PAPERS 

1. Campbell,AK Patel,A. (1983) Biochem J. 216:185-194. A homogeneous 
immunoassay for cyclic nucleotides based on chemiluminescence energy transfer. 

2. Roberts,PA Knight,J Campbell,AK. (1987) Anal. Biochem. 160:139-148. Pholasin - a 
bioluminescent indicator for detecting activation of single neutrophils. 

15 3. Mueller,T Davies,EV Campbell,AK. (1989) J. Biolum. Chemilum. 3:105-113. 
Pholasin Chemiluminescence Detects Mostly Superoxide Anion Released from 
Activated Human Neutrophils. 

4. MuellerJ Campbeli,AK. (1989) J. Biolum. Chemilum. 5:25-30. The Chromophore of 
Pholasin: A Highly Luinescent Protein. 

20 5. Sala-Newby,G and Campbell,AK (1991) Biochem.J. 279:727-732. Engineering a 
bioluminescent indicator for cyclic AMP dependent protein kinase . 

6. Campbell, AK, Trewavas, AJ and Knight, MR (1996). Calcium imaging shows 
differential sensitivity to cooling and communication in luminous transgenic plants. 
Cell Calcium 19:211-218. 

25 7. Kendall, JM, Sala-Newby, G Badminton, M, Campbell AK and Rembold, CR (1996). 

Free Ca 2+ in the endoplasmic reticulum of living cells measured using aequorin as a 
pseudo luciferase. Biochem.J 318:383-387 

8. Badminton, MN, Campbell, AK and Rembold, CR (1996). J.Biol. Chem. 271:31210- 
31214. Differential regulation of nuclear and cytosolic Ca 2+ in HeLa cells. 

30 9. Sala-Newby, GB, Taylor, KT, Badminton, MN, Rembold, CR and Campbell, AK 
(1998). Imaging bioluminescent indicators shows Ca 2+ and ATP permeability 
thresholds in, live cells attacked by complement Immunology. 93:4:601-609 

10. Sala-Newby, GB, Kendall, JM, Jones, H, Taylor, KM, Badminton, MN, Llewellyn, 
DH and Campbell, AK (1999). Targeting bioluminescent proteins to defined 
35 compartments fo living cells. Methods in Enzymology Bioluminescence and 
Chemiluminescence. ed Ziegler, M and Baldwin, T. 
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EXAMPLE 1: ProdiHHTn of a BOIP in bacteria 



c or genomic DNA coding for apopholasin, with or without the cDNA coding for the signal 
peptide, is amplified by PCR with restriction sites such as BamHl at each end. The cDNA is 
5 run on an agarose gel and the full length DNA eluted and purified. The DNA is then cut 
with BamHl to generate sticky ends and ligated into an expression plasm id such as pET3a, 
which has been cut with BamHl also. After ligation the sealed plasmid is transformed into a 
standard E.coli K12 strain such as JM109, a colony picked off for a large plasmid 
preparation. After checking that the plasmid contains the correct sequence for apopholasin 

10 and is in the correct orientation the plasmid is then used to transform a standard expression 
strain of E.coli such as BL21(DE3) or other expression strain. A colony is picked off the 
agar plate and grown up for 2h in standard LB broth. IPTG is added as inducer for a further 
2h. Apopholasin can then be extracted by breaking the bacteria by lysozyme digestion or 
sonication in a standard salt medium such as 50mM HEPES pH 7 +/- ImM ascorbate. Since 

1 5 the apopholasin is unglycosylated it tends to aggregate and form inclusion bodies. These 
can be broken using 8M urea or guanidinium chloride and these then dialysed off. If the pH 
of PAGE gels is alkaline this also tends to allow aggregation of both the unglycosylated and 
glycosylated apo- and full pholasin. A signal peptide such p-lactamase signal will target the 
BOIP to the periplasm ic space, resulting in the ability to secrete the expressed protein from 

20 the external fluid of the cells. 



EXAMPLE 2: Production of a BOIP in insect cells 

c or genomic DNA coding for apopholasin is inserted into a plasmid suitable for 
25 conversion into baculovirus when transfected into insect cells. Since pholasin is secreted 
by Pholas itself there is a signal peptide at the N-terminus. Removal of this by PCR will 
allow cytosolic expression in insect cells. If the signal peptide is left on or changed for 
honey bee mellitin signal peptide, the apopholasin is secreted into the external medium. 
The virus containing the DNA for apopholasin is then purified and stored until required. 
30 An aliquot is then added to fresh insect cells and these incubated for 3-7 days. The 
apopholasin is then isolated from the supernatant if a signal peptide is used, or from the 
cells is not. The apopholasin can then be purified by ammonium sulphate precipitation, 
gel filtration and DEAE chromatography. The state of glycosylation can be assessed by 
running the protein on PAGE when the molecular weight is 34Kda. Removal of the 



glycosylation by enzyr 
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"eturns the protein to the size of apopho 




23.5Kda. It can be 
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stored frozen or freeze dried, and activated to form pholasin by addition of luciferin as 
described in Example 3. 

Since the apopholasin tends to aggregate in the insect supernatant it is important to get the 
protein into non-aggregating buffer, e.g. 50mM HEPES pH 6, l-10mM ascorbate, as soon 
as possible. 

Formation of pholasin can then be achieved as described in Example 3. 
EXAMPLE 3: Generating pholasin and light emission 

To generate light the apopholasin must first be converted into pholasin with the luciferin. 
The luciferin can be extracted from native pholasin by mild acid, or by methanol, mild 
acid or alkaline treatment of light organs isolated from Pholas dactylus or the whole 
organism. After homogenisation the extract is centrifuged or filtered to remove 
particulate material. Further purification can be achieved by tic of hplc. The luciferin is 
best stored dry, but can be stored at -70°C. The intactness and concentration can be 
estimated by measuring the absorbance or fluorescence. The details are as follows: 

(a) Isolation of the luciferin 

Four protocols (1-4) have been developed to extract and isolate the luciferin 
responsible for light emission in pholasin. The luciferin is a small organic moiety 
tightly bound to apopholasin when pholasin is isolated from Pholas dactylus, but 
also can be found not bound to apopholasin. Thus the extraction procedure isolates 
either form of luciferin. 

1. The organism Pholas dactylus or its light organs are homogenised in 



50mM sodium phosphate pH 6.0 on ice. The pholasin is precipitated with 
saturated ammonium sulphate (4°C stirred), and then removed by 
centrifugation at ca 15,000g for 30min in the cold. The supernatant is then 
passed down a SEP-PAK silica column, which binds the luciferin. The 
column is washed with 5ml ethyl acetate and then 5ml of methanol. The 
active fractions containing the luciferin are assayed either by reactivation 
of the apopholasin or by chemituminescence in DMSO, DMF, or NaOCI. 



The luci 
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is concentrated and can be purified fu 




on tic or hplc with 



a standard solvent. It is dried and stored at -70°C. 

The organism Pholas dactylus or its light organs are homogenised in cold 
acetone on ice, filtered through a Buchner funnel, and extracted with 
methanol:acetone (1:1), the residual powder being extracted 3 times with 
methanol and extracts combined. These are then concentrated in a 
Rotavaporator and left to stand for Ih on ice to allow further precipitation. 
The suspension is then refiltered and concentrated. The solution containing 
the luciferin is then passed down a SEP-PAK silica column which binds 
the luciferin. The column is washed with 5ml ethyl acetate and then 5ml of 
methanol. The active fractions containing the luciferin are assayed either 
by reactivation of the apopholasin or by chemiluminescence in DMSO, 
DMF, or NaOCl. The luciferin is concentrated and can be purified further 
on tic or hplc with a standard solvent. It can be dried and stored at -70°C. 
The organism Pholas dactylus or its light organs are homogenised in cold 
acetone on ice, and filtered through a Buchner funnel to give an acetone 
powder. This is then extracted with methanoI:acetone (1:1), twice for 
lOmin and then 3 times with methanol. The extracts are combined and 
concentrated in a Rotavaporator. They are left to stand for lh on ice to 
allow further precipitation, refiltered and concentrated. The residual 
powder is resuspended in 50mM sodium phosphate pH 6.0, lOmM 
ascorbate, and ultrafiltered with a lOkD Amicon membrane at 4 C for 
pholasin. The solution containing the luciferin is then passed down a SEP- 
PAK silica column which binds the luciferin. The column is washed with 
5ml ethyl acetate and then 5ml of methanol. The active fractions 
containing the luciferin assayed either by reactivation of the apopholasin or 
by chemiluminescence in DMSO, DMF, or NaOCl. The luciferin is 
concentrated and can be purified further on tic or hplc with a standard 
solvent. It is dried and stored at -70°C. 

The organism Pholas dactylus or its light organs are homogenised in 
50mM HEPES buffer, with methanol and lOOmM HC1 on ice, and 
incubated for 2h on ice. After centrifugation at ca 1 5,000g for 30m in in the 
cold, the supernatant is then passed down a SEP-PAK silica column which 
binds the luciferin. The column is washed with 5ml ethyl acetate and then 



anol. The active fractions containing trrenuc 



5ml of nWRianol. The active fractions containing mWuciferin are assayed 
either by reactivation of the apopholasin or by chemiluminescence in 
DMSO, DMF, or NaOCl. The luciferin is concentrated and can be purified 
further on tic with a standard solvent. It is dried and stored at -70°C. 
5 Method 4 normally generates most luciferin. The luciferin is characterised by its 

absorbance and fluorescence spectrum, and by its chemiluminescence with 
DMSO, NaOCl and apopholasin. 
(b) Generation of pholasin from apopholasin and the luciferin 

A small sample of the luciferin (l-10ul) is added to apopholasin in an appropriate 
10 buffer (50mM HEPES pH 6-7.5, +/- 0.1% gelatine, +/- l-10mM ascorbate, or 

500mM NaCl, lOmM TES, ImM EDTA, ImM mercaptoethanol pH 6-7.5). The 
mixture is incubated at room temperature for up to 24h, and the pholasin assayed 
by adding an oxygen metabolite, e.g. NaOCl, or luciferase to a sample. When 
apopholasin has been expressed in cells, the luciferin is added externally, 
15 microinjected into individual cells or added via liposomes to get the luciferin into 

the cell. 



Light is detected and quantified in a standard luminometer, imaging camera 
(intensified or CCD), or by a silicon chip. 

20 

EXAMPLE 4: Production of a BOIP in vitro 

c or genomic DNA coding for apopholasin, with or without the signal peptide, is 
amplified by PCR with the 5' primer containing the DNA coding for T7 RNA 

25 polymerase. The DNA product is purified and precipitated. After dissolving in lOmM 
tris/lmMEDTA pH7, the DNA is added to a standard in vitro transcription/translation 
system such as rabbit reticulocyte lysate or wheat germ agglutinin and incubated at 30°C 
for 30-60min. The apopholasin can then be purified and activated to form pholasin as 
described in Example 3. 

30 EXAMPLE 5: Targeting a BOIP in vitro 

The BOIP can also be incorporated into a defined part of a live cell by chemical means or 
by genetically engineering the BOIP to contain a signal peptide which locates the BOIP to 
the inner or outer surface of the plasma membrane or within a particular organelle such as 



peroxisome, mitochon 




chloroplast, tonoplast, endoplasi 
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reticulum, Golgi, 



10 



15 



20 



25 



endosome, lysosome, secretory vesicle, nucleus, nucleolus, proteosome, or gap junction, 
or structure such as microtubule, cytoskeleton, nuclear skeleton, nuclear receptor, or 
mitotic spindle. The signal peptide, added either chemically or genetically, will normally 
target the normal or altered BOIP to a particular intra- or extra-cellular site for example, 
the sequence MLSRLSLRLLSRYLL (SEQ ID NO: 35) or part of cytochrome oxidase on 
the N-terminus will target the BOIP to the mitochondrion; 
KKSALLALMYVCPGKADKE (SEQ ID NO: 36) or MLLPVPLLLGLLGLAA (SEQ ID 
NO: 37) or the ER protein calreticulin at the N-terminus will target the BOIP to the 
endoplasmic reticulum, a KDEL (SEQ ID NO: 38) or HDEL (SEQ ID NO: 39) sequence 
at the C-terminus retaining it there. SKL at C-terminus targets BOIP to the peroxisome, 
PKKKRKV (SEQ ID NO: 11) or an extension of this SV40 large T-antigen signal will 
target it to the nucleus, and a palmitoylation and/or a myristoylation signal 
(MGCVCSSNPD (SEQ ID NO: 42) = the LCK N-terminal acylation motif from tyrosine 
kinase) will target it to the plasma membrane. By coupling the BOIP to another protein 
which targets itself to a particular site then the BOIP is also targeted here. For example, 
coupling the nuclear proteins nucleoplasm^ or lamin B receptor to BOIP targets it to the 
nucleus; cytochrome oxidase at the N-terminus targets BOIP to the mitochondria; 
chlorophyll at the N-terminus targets BOIP to the chloroplast; and a connexin at the N- 
terminus targets BOIP to the gap junction or plasma membrane, SNAP 25 to the plasma 
membrane. 

In order to target pholasin to defined sites in living cells, the DNA coding for these 
targeting sequences are added by using PCR. For cytosolic apopholasin the native signal 
peptide is removed and also the BOIP can be linked to larger proteins at the N- or C- 
terminus such as firefly luciferase or aequorin to prevent it getting into the nucleus. This 
also enables ATP and oxygen metabolites, or Ca 2+ and oxygen metabolites to be measured 
simultaneously in the same cells by intensity, colour or polarisation of the different 
bioluminescent indicators. A multiple bioluminescent indicator can also be engineered by 
PCR, or by using restriction enzyme sites, from the DNA coding for 3 or more 
bioluminescent proteins. A simple screen of the transformed bacteria enables the multiple 
rainbow protein to be isolated with 2-3 colours or more. 



The DNA is then addelWo an in vitro transcription/translation s^fem as described in 
Example 4 in the presence of the organelle to be targeted (e.g. microsomes for the 
endoplasmic reticulum, which glycosylate apopholasin), 



The new DNA can also be inserted into a plasm id by standard techniques, and 
transformed into bacteria or transfected or injected into eukaryotic cells such as HeLa or 
COS. 



Addition of the luciferin as described in Example 3 allows formation of pholasin which 
can then be detected by light emission. Changes in oxygen metabolite production are then 
be detected by a luminometer or imaging camera when the cells are exposed to external 
oxygen metabolites, a change in oxygen concentration, addition of stimuli e.g. TNF, EGF, 
hormones or drugs, or attack by pathogens such as bacteria, viruses, complement, 
antibodies, toxins, and cells of the immune system. 



EXAMPLE 6: Engineering a covalent modification site into a BOIP 

(a) The site coding a protein kinase A (RRAS or kemptide), protein kinase C 
(MARCKS), MAP kinase, ERIC, the ER - nuclear signalling kinase IRE IP or a 
phosphatase is added to the N- or C-terminus or inserted at various sites within the 
apopholasin by PCR and expressed as described in Examples 1-5. Pholasin is then 
generated by addition of the luciferin as described in Example 3. 

Addition of the catalytic subuntt for protein kinase A, or activation via cyclic 
AMP inside cells, leads to phosphorylation or dephosphorylation of the modified 
pholasin and change in light emission (intensity, colour or polarisation). 

A preliminary screen is necessary to select the appropriate proteins and to discard 
any which have lost all activity. 

(b) The site coding a protease (thrombin, enterokinase, HIV protease, caspase) is 
added to the N- or C-terminus of the apopholasin by PCR or inserted at various 
sites within the protein, and expressed as described in Examples 1-5. Pholasin is 
then generated by addition of the luciferin as described in Example 3. 
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EXAMPLE 7: Engineering a BOIP into a "Rainbow Protein" 

cDNA coding for apopholasin is linked to another protein by using the cDNA coding for 
5 that protein. For example, wild type GFP, the S65T mutant of the green fluorescent 
protein, YGFP, or EGFP are linked to apopholasin by PCR at the N- or C-terminus, or by 
splitting one or both proteins using multi-step PCR. In between there is a 'reactive' 
peptide with a protease site (a thrombin or enterokinase) and a binding site for IP3, or the 
15 amino acid sequence form IP3 kinase (an IP 4 binding site). At the C-terminus of the 

10 GFP, a peptide containing 6 lysine residues may also be added via PCR. The protein is 
expressed and fluorescein covalently linked to these lysines by addition of fluorescein 
isothiocyanate. Addition of the luciferin forms pholasin as described in Example 3. The 
change in colour occurs by chemiluminescence resonance energy transfer. Without 
fluorescein the rainbow protein emits blue-green light (508nm), which changes to blue 

15 (490nm) when the reactive substance binds to the reactive peptide, or when either 
thrombin or enterokinase is added. When the 6 amino acid linker is used the colour starts 
as green (530nm), and will then change from green, to blue-green and then blue as the 
particular reactive sequence binds their respective analytes. Use of rhodamine instead of 
fluorescein generates a rainbow protein which changes from red to green to blue. 



A preliminary screen is necessary to select the appropriate rainbow proteins and to discard 
any which have lost all activity. 

The other protein linked to apopholasin may be, for example, any one of the following 
25 linked chemically or genetically: 
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1. 



Firefly or any benzothiazole luciferase to the N or C terminus gives two colours 
for ATP and oxygen metabolites. 



30 



2. 



3. 



Any imidazolopyrazine luciferase, including coelenterazine systems - decapod 
shrimp, fish, sqiud, Renilla, anthzoan, Chaetognate, radiolarian, or copepod and 
Vargula systems - ostracod, Porichthys and similar fish, cypridinids and Vargula. 
Any tetrapyrrole luciferase such as dinoflagellate, euphausiid or stomiatoid fish. 



4. 



Bacterial luciferase and other aldehyde or flavin luciferases, including polychate 



worm. 



5. Any GFP, incliU wild type, S65T, enhanced GFP, bl^iFP, yellow GFP, 
Renilla GFP, Pilocarpus GFP, and Pennatula GFP, any anthozan GFP, or any 
coelenterate GFP. 

6. The red fluorescent proteins from stomiatoid fish - Malactosteus, Aristostomias, 
5 Photostomias. 

7. The phycobiliproteins - phycoerythrin and phycocyanobilin. 

8. The blue fluorescent lumazine protein in the bacterium Photobacterium. 

9. The yellow flavin fluorescent protein in Y Vibrio. 

1 0. Any lysine or argininine or other amino acid side chain where a fluor can be added 
10 covalently. In which the case the rainbow protein amy emiot more than two 

colours. For example, rhodamine on a pholasin-linker-GFP chimera will turn from 
red to green to blue. 



A preliminary screen may be necessary to select chimeras which have not lost all 
15 bioluminescent activity. 

The 'reactive' peptide may be a binding site for any analyte, protein or DNA, metabolite, 
substrate vitamin, an enzyme such a protein kinase C or phosphatase, ion channel, ion 
pump, antigen, antibody, nucleotide or nucleoside such as ATP, GTP, ADP, AMP, 
20 adenosine, cAMP, cCMP, cCCP or their deoxy equivalents, and inositol phosphates such 
as IP 3 or IP 4 , a lipid such as diacyl glycerol, phosphatidyl inositol bisphosphate, 
phosphate, a cation such as Ca 2+ , K + or Na + , Cu 2+ or Zn 2+ , or anion such as CP , sulphate, 
or gas such as NO, 0 2 or H 2 , or a protein binding site such as calmodulin, kinesin, dynein, 
tubulin, or myosin. 

25 

When pholasin is triggered by oxygen metabolites, the Pholas luciferase or peroxidase, 
energy transfer occurs from pholasin oxyluciferin through GFP to fluorescein resulting in 
a yellow emission. Addition of thrombin for 3h cleaves the GFP-fluorescein from the 
pholasin and the light emission returns to the blue of native pholasin. Addition of IP 3 to 
30 the full chimera alters the efficiency of energy transfer. As a result there is a change in the 
ratio of light emitted in the yellow to blue. This ratio is directly related and can be plotted 
against the concentration or amount of analyte. The light is detected in a dual wavelength 
luminometer or ratiometric imaging camera and the ratio of blue to green light measured. 



Alternatively any fluor: 




be used, and any binding sites with th 
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,t characteristics as 



shown in these examples will work provided a simple screen is used to select the right 
chimeras. 

EXAMPLE 8: Engineering a BOIP into a "Rainbow Protein" for two analytes 
together 

Apopholasin is linked to firefly luciferase by using cDNAs and PCR, followed by 
expression in insect cells as described in Example 2. Addition of the luciferin as 
described in Example 3 generates the pholasin. In the presence of firefly luciferin (ImM), 
ATP and oxygen metabolites, this chimera emits blue and yellow simultaneously which 
can be independently measured by using a dual wavelength luminometer or imaging 
camera. 

EXAMPLE 9: Expression of BOIPs in mammalian cells 

Apopholasin, c or genomic, in an expression plasmid with the CMV promoter, is 
transfected into HeLa cells. After incubation for 3 days to allow expression of the 
apopholasin, the luciferin is added to form pholasin. Expression is checked using a 
polyclonal antibody to pholasin raised in rabbits. Addition of oxygen metabolites outside 
the cell allows the permeability of the plasma membrane to oxygen metabolites to be 
assessed. As the oxygen metabolites permeate into the cytosol, the light emission 
increases. 



EXAMPLE 10: ExprSPGn of BOIPs in plants 



c or genomic DNA coding for apopholasin is inserted into a plasmid with the cauliflower 
mosaic virus promoter and transformed into Agrobacterium tumificans. These are then 
5 added to a tobacco leaf, seedlings generated, and those expressing apopholasin selected. 
The plants are grown to seed, and seedlings grown from this seed. Addition of luciferin 
forms the pholasin as described in Example 3. Stressing the plant, e.g. with wind, touch, 
cold, or peroxide, or during growth and development or by a hormone, generates light, 
showing the formation of oxygen metabolites within the live plant. A cell-specific 
10 promoter engineered on to the apopholasin cDNA before making the transgenic plant 
enables oxygen metabolites to be detected in specific cells within the whole, living plant. 

EXAMPLE 1 1 : Detection of oxidative damage in vitro 

15 Addition of pholasin to serum or plasma from a rat, mouse or human enables oxygen 
metabolites to be detected and measured on addition of a drug or other substance of 
interest. 

EXAMPLE 12: Detection of ROMs in a heart cells 

20 

Reperfusion has been proposed to lead to oxygen metabolite damage in cardiac myocytes. 
Pholasin allows this to be tested for the first time. Plasmid containing apopholasin cDNA 
and the CMV promoter is transfected into isolated cardiac myocytes in culture. 
Expression occurs within 1-3 days, and pholasin is formed by addition of the luciferin as 
25 described in Example 3. Subjecting the cells to hypoxia followed by readmission of 
normal oxygen leads to light emission, showing that oxygen metabolites have been 
generated inside the cells. By using an imaging camera, the digital or analogue nature of 
this can be assessed as the number of cells emitting light can be visualised and counted. 



30 
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EXAMPLE 13: DeteflMf of ROMs in the nucleus and endoplaWJc reticulum (ER) 




Plasm id-containing apopholasin cDNA with either nucleoplasmin DNA or calreticulin 
DNA (with or without KDEL on the C-terminus) linked to the pholasin DNA, to target 
5 the apopholasin to the nucleus or ER respectively, and the CMV promoter for expression, 
is transfected into HeLa cells in culture. Expression occurs within 1-3 days, and pholasin 
is formed by addition of the luciferin as described in Example 3. Addition of oxygen 
metabolites outside the cells, or hypoxic/oxygen shock generates light measured in a 
luminometer, showing how fast oxygen metabolites penetrate into these organelles. By 
10 imaging with a photon counting imaging camera, the number of cells permeable to 
oxygen metabolites can be counted. Location of the pholasin can be assessed by imaging 
live cells, or by using immunofluorescence with the pholasin antibody on partially-fixed 
cells or GFP-pholasin in live cells. Using a rainbow protein, two or more analytes can be 
detected together. 

15 

EXAMPLE 14: Use of pholasin as a protein label 

Pholasin can be used as a label in homogeneous or heterogeneous immunoassay. 
Apopholasin is first covalently linked to an antibody to HIV, and pholasin formed by 
20 addition of luciferin as described in Example 3. The antibody is then used in a standard 
chemiluminometric immunoassay format. Addition of HIV antigen leads to an increase in 
antibody binding and an increase in light emission dependent on the amount of HIV 
added. The amount of HIV in a blood sample can be assessed by relating the pholasin 
light emission in the sample to the standard curve. 

25 

EXAMPLE 15: Pholasin as a DNA label 



Apopholasin is covalently linked to an oligonucleotide probe for detecting the presence of 
the cystic fibrosis gene. Addition of the probe to DNA in a standard Southern blot allows 
30 the probe to bind when the gene is present. Addition of luciferin as described in Example 
3 allows the pholasin to form. Addition of hypochlorite (lOmM) in barbitone buffer pH 9 
causes the pholasin to flash and the gene can be visualised by the photon counting 
imaging camera. 



EXAMPLE 16: PholafBTin a two hybrid system 



Protein-protein interaction can be detected by engineering apopholasin to one half of a 
two hybrid system and GFP to the other. Binding will allow the yeast to grow. 

5 

EXAMPLE 17: Pholasin in genetic entertainment 

Pholasin is able to chemiluminesce at a wide range of pH (3-10), including acid pH such 
as 3-4. Thus it can be added to drinks such as beer, cola, soft drinks, and spirits to make 
10 them glow. It can also make food glow by adding to them to the ingredients of cakes, 
icing, popcorn; by painting the pholasin or apopholasin on to the food, or by making it 
genetically in the source of the food. It can be used in a wide range of toys and other 
entertaining devices including squirt guns, greeting cards, pens. 

15 The rainbow proteins can also be used as an alternative to pholasin alone, resulting in a 
rainbow of colours and colour changes. 

EXAMPLE 18: Pholasin in trangenic animals 

20 Transgenic animals such as nematodes, mice or plants can be generated from apopholasin 
cDNA by standard techniques. Injecting the luciferin or incubating whole plant in it forms 
the active pholasin. Oxygen or its metabolites can then be detected, measured and imaged, 
in an intact organ, or from the whole organism. It can also be used in humans, in DNA 
therapy or diagnosis. 

25 

EXAMPLE 19: Apoprotein from the luminous squid Ommastrophes 

The use of apoprotein from the luminous squid Ommastrophes is substituted for 
apopholasin, and the methods of Examples 1 to 1 8, above, are carried out. 

30 

EXAMPLE 20: Apoprotein from the mollusc Rocellaria 

The use of the apoprotein from the mollusc Rocellaria is substituted for apopholasin, and 
the methods of Examples 1 to 18, above, are carried out. 
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EXAMPLE 21: Earthworm luciferase 

The use of earthworm luciferase as a BOIP is substituted for apopholasin, and the 
5 methods of Examples 1 to 1 8, above, are carried out. 



EXAMPLE 22 

10 Genomic DNA from Pholas, Rocellaria, Ommastrophes, or earthworm is substituted for 
the recombinant protein in Examples 1 to 1 8, above, the methods of which are carried out 
in an analogous manner. 
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I. Basis of the report 

1 . This report has been drawn on the basis of (substitute sheets which have been furnished to the receiving Office in 
response to an invitation under Article 14 are referred to in this report as "originally filed" and are not annexed to 
the report since they do not contain amendments (Rules 70. 16 and 70.17).): 

Description, pages: 

1-28 as originally filed 

Claims, No.: 

1 .30 as received on 07/1 2/2000 with letter of 07/1 2/2000 

Drawings, sheets: 

1/1 4-1 4/14 as originally filed 

Sequence listing part of the description, pages: 

1-9, filed with the letter of 16.02.2000 

2. With regard to the language, all the elements marked above were available or furnished to this Authority in the 
language in which the international application was filed, unless otherwise indicated under this item. 

These elements were available or furnished to this Authority in the following language: , which is: 

□ the language of a translation furnished for the purposes of the international search (under Rule 23.1 (b)). 

□ the language of publication of the international application (under Rule 48.3(b)). 

□ the language of a translation furnished for the purposes of international preliminary examination (under Rule 
55.2 and/or 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the 
international preliminary examination was carried out on the basis of the sequence listing: 

□ contained in the international application in written form. 

□ filed together with the international application in computer readable form. 
E furnished subsequently to this Authority in written form. 

S furnished subsequently to this Authority in computer readable form. 

[3 The statement that the subsequently furnished written sequence listing does not go beyond the disclosure in 

the international application as filed has been furnished. 
S The statement that the information recorded in computer readable form is identical to the written sequence 

listing has been furnished. 

4. The amendments have resulted in the cancellation of: 
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□ the description, 

□ the claims, 



pages: 
Nos.: 



□ the drawings, 



sheets: 



5. □ This report has been established as if (some of) the amendments had not been made, since they have been 

considered to go beyond the disclosure as filed (Rule 70.2(c)): 

(Any replacement sheet containing such amendments must be referred to under item 1 and annexed to this 
report.) 

6. Additional observations, if necessary: 

III. Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 

1 . The questions whether the claimed invention appears to be novel, to involve an inventive step (to be non- 
obvious), or to be industrially applicable have not been examined in respect of: 

□ the entire international application. 

H claims Nos. 1 ,4,5,9-1 1,13-19,27-29 (all partially); 30 (completely). 



□ the said international application, or the said claims Nos. relate to the following subject matter which does 
not require an international preliminary examination (specify): 

El the description, claims or drawings {indicate particular elements beloW) or said claims Nos. 
1,4,5,9-1 1 ,13-19,27-30 are so unclear that no meaningful opinion could be formed {specify): 
see separate sheet 

C3 the claims, or said claims Nos. 1 ,4,5,9-1 1 ,13-19,27-30 are so inadequately supported by the description that 
no meaningful opinion could be formed. 

□ no international search report has been established for the said claims Nos. . 

2. A meaningful international preliminary examination report cannot be carried out due to the failure of the nucleotide 
and/or amino acid sequence listing to comply with the standard provided for in Annex C of the Administrative 
Instructions: 

□ the written form has not been furnished or does not comply with the standard. 

□ the computer readable form has not been furnished or does not comply with the standard. 

V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 



because: 
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1. Statement 



Novelty (N) 



Yes: 
No: 



Claims 
Claims 



I- 10,14,16,17 

II- 13,15,18-29 



Inventive step (IS) 



Yes 
No: 



Claims 
Claims 



1-10,14,16,17 



Industrial applicability (IA) 



Yes 
No: 



Claims 
Claims 



1-20,28,29 
21-27 



2. Citations and explanations 
see separate sheet 

VIII. Certain observations on the international application 

The following observations on the clarity of the claims, description, and drawings or on the question whether the 
claims are fully supported by the description, are made: 
see separate sheet 
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Re Item III 

Non-establishment of opinion with regard to novelty, inventive step and 
industrial applicability 

No meaningful opinion could be formed on claim 1(b)(c)(d) and on claims 4, 5, 9, 
10, 15-17 and 27-29 referring thereto since the scope of these claims is 
completely unclear. 

In claim 1 (b), the term "substantially homologous" is unclear and no clear 
definition is given in the description. It should also be noted that an unclear term 
cannot be used to distinguish the claimed nucleic acid from known nucleic acids 
encoding luciferin-binding proteins. 

Claim 1 (c) is even more unclear than (b) since it defines a sequence by its 
homology or hybridization to the sequence (b) which is also not clearly defined. 
Claim 1 (c) relates to sequences which are homologous or hybridize to sequences 
which are homologous or hybridize to sequences (a). It is thus completely 
impossible for the skilled reader to known whether a given nucleic acid sequence 
is encompassed by the claim. The same applies to claim 1(d) which completely 
lacks technical features. 

Furthermore, no meaningful opinion could be formed on the "mutant or variant ... 
capable of binding to luciferin" of claim 1 1 , since these terms are unclear and 
would appear to encompass at least any known luciferin-binding protein. 

Claim 30 is completely unclear due to the lack of technical features. 

Consequently, the present opinion has only been established for claim 1(a), 2, 3, 
6-8, 12, claims 4, 5, 9, 10, 15-17, 27-29 only as far as they refer (directly or 
indirectly) back to claim 1(a), claim 1 1 only as far as it relates to apopholasin itself, 
claims 13, 14, 18, 19, 27 and 28 as far as they refer to this part of claim 11, and 
claims 20-26. 
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Re Item V 

R ason d statement und r Article 35(2) with regard to novelty, inv ntiv step or 
industrial applicability; citations and explanations supporting such statement 

1 . The following documents are cited: 

D1: ROBERTS, P. ET AL, ANAL BIOCHEM., vol. 160, no. 1, 1987, pages 139- 
148 

D2: COTTON, B. ET AL. f BIOCHEMICAL SOCIETY TRANSACTIONS, vol. 17, 

no. 4, 1989, pages 705-706 
D3: MULLER, T. ET AL, J. BIOLUMIN. CHEMILUMIN., vol. 3, 1989, pages 105- 

113 

D4: DATABASE GENBANK [Online] 10 February 1999 (1999-02-10) DUNSTAN, 
S. L. ET AL.: 'Cloning and expression of the photoprotein pholasin from the 
bivalve Pholas dactylus; a bioluminiscent indicator of reactive oxygen 
species.' 

2. The current assessment is based on the assumption that all claims enjoy priority 
rights from the filing date of the priority document. If it later turns out that this is 
not correct, the document D4 cited in the international search report could 
become relevant. 

3. The present application does not satisfy the criterion set forth in Article 33(1 )(2) 
PCT because the subject-matter of claims 11-13, 15 and 18-29 is not new. 

The present application relates to the cloning and sequencing of cDNA coding for 
the bioluminescent indicator protein pholasin from the bivalve mollusc Pholas 
dactylus . The native pholasin is made up of a glycosylated protein of 34 kD and a 
small organic molecule, the luciferin. The present application refers to this 34 kD 
glycoprotein as the apoprotein, whereas certain prior art documents use a less 
precise terminology refer to it as pholasin (see for instance D2, page 705, column 
2, lines 7-9: "Pholasin (...) is a 35 kD acidic glycoprotein" or D3, page 112, column 
1, paragraph 2: "Pholasin is a chemiluminescent 34 kD glycoprotein". 
Irrespective of the terminology used, the glycoprotein of pholasin and its 
purification from Pholas dactylus are well-known from the prior art, see each of the 
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documents D1 to D3. Moreover, the use of pholasin in the intra- and extracellular 
detection/measurement of reactive oxygen metabolites is also known from these 
prior art documents. Therefore, claims 11-13, 15 and 18-29 lack novelty. It should 
be noted that the reference to a known protein being produced recombinantly 
does not establish its novelty. 

It should furthermore be noted that the cells and organisms according to claim 1 5 
are not clearly distinguished from naturally occurring Pholas dactylus and cells 
thereof. 

4. The present application does not satisfy the criterion set forth in Article 33(3) PCT 
because the subject-matter of claims 1-10, 14, 16 and 17 does not involve an 
inventive step. 

Claims 1-10, 16 and 17 relate to nucleic acids encoding the known pholasin 
protein. However, at the present priority date, the cloning and sequencing of a 
known protein were routine tasks. A person skilled would therefore have 
reasonably expected that he would be able to clone and sequence the gene 
encoding the pholasin protein. Moreover, it appears from the description that this 
task did not pose such problems as to prove that the expectation of success was 
ill-founded. Therefore, the claimed subject-matter cannot be considered to involve 
an inventive step. 

Claims 4 and 14 do not involve an inventive step since the expression of 
recombinant proteins in prokaryotes such as E. coli , resulting in non-glycosylated 
proteins, is standard in the field. 

5. For the assessment of the present claims 21-27, as far as they relate to in vivo 
methods, on the question whether they are industrially applicable, no unified 
criteria exist in the PCT Contracting States. The patentability can also be 
dependent upon the formulation of the claims. The EPO, for example, does not 
recognize as industrially applicable the subject-matter of claims to the use of a 
compound in medical treatment, but may allow, however, claims to a known com- 
pound for first use in medical treatment and the use of such a compound for the 
manufacture of a medicament for a new medical treatment. 
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Re Item VIII 

Certain observations on the international application 

6. The subject-matter of claims 6, 7 and 20-26 extends to photoproteins other than 
pholasin, and is therefore neither enabled (Article 5 PCT) nor supported (Article 6 
PCT) by the description. 

7. It is not entirely clear in claims 6 and 7 what the claimed "constructs" should be 
(Article 6 PCT). It would appear that they should be nucleic acids, and although 
claims 1 , 6 and 7 have been drafted as separate independent claims, they appear 
to relate effectively to the same subject-matter and to differ from each other only 
with regard to the definition of the subject-matter for which protection is sought 
and in respect of the terminology used for the features of that subject-matter. The 
aforementioned claims therefore lack conciseness. Moreover, lack of clarity of the 
claims as a whole arises, since the plurality of independent claims makes it 
difficult, if not impossible, to determine the matter for which protection is sought, 
and places an undue burden on others seeking to establish the extent of the 
protection. Hence, these claims do not meet the requirements of Article 6 PCT. 
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